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SPECIFIC A TION 
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CONSENSUS /ANCESTRAL IMMUNOGENS 

TECHNICAL FIELD 

The present invention relates, in general, to 
an immunogen and, in particular, to an immunogen for 
5 inducing antibodies that neutralize a wide spectrum 
of HIV primary isolates and/or to an immunogen that 
induces a T cell immune response. The invention 
also relates to a method of inducing ant i -HIV 
antibodies, and/or to a method of inducing a T cell 
10 immune response, using such an immunogen. The 

invention further relates to nucleic acid sequences 
encoding the present immunogens . 

BACKGROUND 

The high level of genetic variability of HIV-1 
15 has presented a major hurdle for AIDS vaccine 

development. Genetic differences among HIV-1 groups 
M, N, and O are extensive, ranging from 30% to 50% 
in gag and env genes, respectively (Gurtler et al , 
J. Virol. 68:1581-1585 (1994), Vanden Haesevelde et 
20 al, J. Virol. 68:1586-1596 (1994), Simon et al , Nat. 
Med. 4:1032-1037 (1998), Kuiken et al, Human 
retroviruses and AIDS 2000: a compilation and 
analysis of nucleic acid and amino acid sequences 
(Theoretical Biology and Biophysics Group, Los 
25 Alamos National Laboratory, Los Alamos, New 
Mexico) ) . Viruses within group M are further 
classified into nine genetically distinct subtypes 
(A-D, F-H, J and K) (Kuiken et al, Human 
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retroviruses and AIDS 2000: a compilation and 
analysis of nucleic acid and amino acid sequences 
(Theoretical Biology and Biophysics Group, Los 
Alamos National Laboratory, Los Alamos, New Mexico, 
5 Robertson et al, Science 288:55-56 (2000), Robertson 
et al, Human retroviruses and AIDS 1999: a 
compilation and analysis of nucleic acid and amino 
acid sequences, eds . Kuiken et al (Theoretical 
Biology and Biophysics Group, Los Alamos National 

10 Laboratory, Los Alamos, New Mexico) , pp. 492-505 

(2000) ) . With the genetic variation as high as 30% 
in env genes among HIV-1 subtypes, it has been 
difficult to consistently elicit cross-subtype T and 
B cell immune responses against all HIV-1 subtypes. 

15 HIV-1 also frequently re combines among different 
subtypes to create circulating recombinant forms 
(CRFs) (Robertson et al, Science 288:55-56 (2000), 
Robertson et al , Human retroviruses and AIDS 1999: a 
compilation and analysis of nucleic acid and amino 

20 acid sequences, eds. Kuiken et al (Theoretical 

Biology and Biophysics Group, Los Alamos National 
Laboratory, Los Alamos, New Mexico) , pp. 492-505 
(2 000) , Carr et al, Human retroviruses and AIDS 
1998 : a compilation and analysis of nucleic acid and 

25 amino acid sequences, eds. Korber et al (Theoretical 
Biology and Biophysics Group, Los Alamos National 
Laboratory, Los Alamos, New Mexico) , pp. III-10-III- 
19 (1998)). Over 20% of HIV-1 isolates are 
recombinant in geographic areas where multiple 

30 subtypes are common (Robertson et al , Nature 

374:124-126 (1995), Cornelissen et al, J. virol . 
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70:8209-8212 (1996), Dowling et al , AIDS 16:1809- 
1820 (2002)), and high prevalence rates of 
recombinant viruses may further complicate the 
design of experimental HIV-1 immunogens . 
5 To overcome these challenges in AIDS vaccine 

development, the use of centralized HIV-1 genes 
(consensus, ancestral and center of the tree) to 
decrease genetic distances between candidate 
immunogens and field viral strains has been proposed 
10 (Gaschen et al , Science 2 96:2354-2360 (2002), Gao et 
al, Science 299:1517-1518 (2003), Nickle et al , 
Science 299:1515-1517 (2003)). Centralized sequences 
are located close to the root of HIV-1 phylogenetic 
trees and can significantly decrease genetic 
15 distances to contemporary field HIV-1 isolates. 

The present invention results, at least in 
part, from the results of studies designed to 
determine if centralized immnunogens can induce both 
T and B cell immune responses in animals. These 
20 studies have involved the generation of an 

artificial group M consensus env gene (CON6) , and 
construction of DNA plasmids and recombinant 
vaccinia viruses to express CON6 envelopes as 
soluble gpl20 and gpl40CF proteins. The results 
25 demonstrate that CON6 Env proteins are biologically 
functional, possess linear, conformational and 
glycan- dependent epitopes of wild- type HIV-1, and 
induce cytokine -producing T cells that recognize T 
cell epitopes of both HIV subtypes B and C. 
30 Importantly, CON6 gpl2 0 and gpl4 0CF proteins induce 
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antibodies that neutralize subsets of subtype B and 
C HIV-1 primary isolates. 

SUMMARY OF THE INVENTION 

The present invention relates to an immunogen 
5 for inducing antibodies that neutralize a wide 
spectrum of HIV primary isolates and/or to an 
immunogen that induces a T cell immune response, and 
to nucleic acid sequences encoding same. The 
invention also relates to a method of inducing anti- 
10 HIV antibodies, and/or to a method of inducing a T 
cell immune response, using such an immunogen. 

Objects and advantages of the present invention 
will be clear from the description that follows. 

BRIEF DESCRIPTION OF THE DRAWINGS 



15 Figures 1A-1D: Generation and expression of 

the group M consensus env gene (CON6) . The complete 
amino acid sequence of CON6 gpl60 is shown* 
(Fig. 1A) The five regions from the wild- type 
CRF08_BC (98CN006) env gene are indicated by 

20 underlined letters. Variable regions are indicated 
by brackets above the sequences. Potential N-liked 
glycosylation sites are highlighted with bold-faced 
letters. (Fig. IB) Constructs of CON6 gpl20 and 
gpl40CF. CON6 gpl20 and gpl40CF plasmids were 

25 engineered by introducing a stop codon after the 
gpl20 cleavage site or before the transmembrane 
domain, respectively. The gpl20/gp41 cleavage site 
and fusion domain of gp41 were deleted in the 
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gpl40CF protein. (Fig.lC) Expression of CON6 gpl20 
and gpl40CF. CONG gpl20 and gpl40CF were purified 
from the cell culture supernatants of rW- infected 
293T cells with galanthus Nivalis argarose lectin 
5 columns. Both gpl20 and gpl40CF were separated on a 
10% SDS-polyarylamide gel and stained with Commassie 
blue. (Fig. ID.) CON 6 env gene optimized based on 
codon usage for highly expressed human genes . 

Figures 2A-2E. Binding of CON6 gp!2 0 gpl4 0 CF 

10 to soluble CD4 (sCD4) and anti-Env mAbs . (Figs. 2A- 
2B) Each of the indicated mabs and sCD4 was 
covalently immobilized to a CM 5 sensor chip 
(BIAcore) and CON6 gpl20 (Fig. 2A) or gpl40CF (Fig. 
2B) (100 /xg/ml and 300 /xg/ml , respectively) were 

15 injected over each surface. Both gpl20 and gpl40CF 
proteins reacted with each anti-gp!2 0 mabs tested 
except fori 7b mab, which showed negligible binding 
to both CON6 gpl20 and gpl40CF. To determine 
induction of 17b mab binding to CON6 gpl2 0 and 

20 gpl40CF / CON6 gpl20 (Fig. 2C) or gpl40CF (Fig. 2D) 
proteins were captured (400-580 RU) on individual 
flow cells immobilized with sCD4 or mabs A32 or T8 . 
Following stabilization of each of the surface, mAb 
17b was injected and flowed over each of the 

25 immobilized flow cells. Overlay of curves show that 
the binding of mab 17b to CON6 Env proteins was 
markedly enhanced on both sCD4 and mab A3 2 surfaces 
but not on the T8 surface (Figs. 2C-2D) . To 
determine binding of CON6 gpl20 and gpl40CF to human 
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mabs in ELISA, stock solutions of 20jig/ml of mabs 
447, F39F, A32, IgGlbl2 and 2F5 on CON6 gpl20 and 
gpl40CF were tittered (Fig. 2E) . Mabs 447 (V3) , 
F39F (V3) A32 (gpl20) and IgGlbl2 (CD4 binding site) 
5 each bound to both CON6 gpl2 0 and 14 0 well, while 
2F5 (anti-gp41 ELDKWAS) only bound gpl40CF. The 
concentration at endpoint titer on gpl20 for mab 447 
and F39F binding was <0.003 pig/ml and 0.006 pig/ml , 
respectively; for mab A32 was <0.125 /ig/ml ; for 
10 IgGlbl2 was <0 . 002 /xg/ml ; and for 2F5 was 0.016 
fig /ml . 

Figures 3A and 3B. Infectivity and coreceptor 
usage of CON6 envelope. (Fig. 3A) CON6 and control 
eny plasmids were cotransf ected with HIV- 1/SG3 Aenv 

15 backbone into human 293T cells to generate Env- 

pseudovirions . Equal amounts of each pseudovirion 
(5 ng p24) were used to infect iJC53-BIj cells. The 
infectivity was determined by counting the number of 
blue cells (infectious units, IU) per microgram of 

20 p24 of pseudovirons (IU/pig p24) after staining the 
infected cells for P-gal expression. (Fig. 3B) 
Coreceptor usage of the CON6 env gene was determined 
on JC53BL cells treated with AMD3100 and/or TAK-7 99 
for 1 hr (3 7°C) then infected with equal amounts of 

25 p24 (5 ng) of each Env-pseudovirion . Infectivity in 
the control group (no blocking agent) was set as 
100%. Blocking efficiency was expressed as the 
percentage of IU from blocking experiments compared 
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to those from control cultures without blocking 
agents. Data shown are mean ± SD. 

Figure 4. Western blot analysis of multiple 
subtype Env proteins against multiple subtype 
5 antisera. Equal amount of Env proteins (100 ng) 
were separated on 10% SDS-polyacryl amide gels. 
Following electrophoresis, proteins were transferred 
to Hybond ECL nitrocellulose membranes and reacted 
with sera from HIV-1 infected patients (1:1,000) or 

10 guinea pigs immunized with CON6 gpl20 DNA prime, rW 
boost (1:1, 000) . Protein-bound antibody was probed 
with fluorescent- labeled secondary antibodies and 
the images scanned and recorded on an infrared 
imager Odyssey (Li-Cor, Lincoln, NE) . Subtypes are 

15 indicated by single- letters after Env protein and 
serum IDs. Four to six sera were tested for each 
subtype, and reaction patterns were similar among 
all sera from the same subtype. One representative 
result for each subtype serum is shown. 

20 Figure 5. T cell immune responses induced by 

C0N6 Env immunogens in mice. Splenocytes were 
isolated from individual immunized mice (5 
mice/group) . After splenocytes were stimulated in 
vitro with overlapping Env peptide pools of CON6 

25 (black column) , subtype B (hatched column) , subtype 
C (white column) , and medium (no peptide; gray 
column) , INF- Y producing cells were determined by 
the ELISPOT assay. T cell IFN-y responses induced 
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by either CON6 gpl2 0 or gpl4 0CF were compared to 
those induced by subtype specific Env immunogens 
( JRFL and 96ZM651) - Total responses for each 
envelope peptide pool are expressed as SFCs per 
5 million splenocytes. The values for each column are 
the mean ± SEM of IFN-y SFCs (n=5 mice/group) . 

Figures 6A-6E. Construction of codon usage 
optimized subtype C ancestral and consensus envelope 
genes (Figs. 6A and SB, respectively). Ancestral 

10 and consensus amino acid sequences (Figs. 6C and 6D, 
respectively) were transcribed to mirror the codon 
usage of highly expressed human genes. Paired 
oligonucleotides (80-mers) overlapping by 20 bp were 
designed to contain 5' invariant sequences including 

15 the restriction enzyme sites EcoRI , Bbsl, Bam HI and 
BsmBI . Bbsl and BsmBI are Type II restriction 
enzymes that cleave outside of their recognition 
sequences. Paired oligomers were linked 
individually using PCR and primers complimentary to 

20 the 18 bp invariant sequences in a stepwise fashion, 
yielding 140bp PCR products. These were subcloned 
into pGEM-T and sequenced to confirm the absence of 
inadvertant mutations/deletions. Four individual 
pGEM-T subclones containing the proper inserts were 

25 digested and ligated together into pcDNA3 . 1 . Multi- 
fragment ligations occurred repeat ly amongst groups 
of fragments in a stepwise manner from the 5' to the 
3' end of the gene until the entire gene was 
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reconstructed in pcDNA3 . 1 . (See schematic in Fig. 
6E. ) 

Figure 7. JC53-BL cells are a derivative of 
HeLa cells that express high levels of CD4 and the 
5 HIV-1 coreceptors CCR5 and CXCR4 . They also contain 
the reporter cassettes of luciferase and 0- 
galactosidase that are each expressed from an HIV-1 
LTR. Expression of the reporter genes is dependent 
on production of HIV-1 Tat. Briefly, cells are 

10 seeded into 24 or 96 -well plates, incubated at 37 °C 
for 24 hours and treated with DEAE-Dextran at 37 °C 
for 30 minutes- Virus is serially diluted in 1% 
DMEM, added to the cells incubating in DEAE-Dextran, 
and allowed to incubate for 3 hours at 3 7 °C after 

15 which an additional cell media is added to each 

well. Following a final 48-hour incubation at 37°C, 
cells are either fixed, stained using X-Gal to 
visualize (J-galactosidase expressing blue foci or 
frozen- thawed three times to measure luciferase 

20 activity. 

Figure 8. Sequence alignment of subtype C 
ancestral and consensus env genes. Alignment of the 
subtype C ancestral (bottom line) and consensus (top 
line) env sequences showing a 95.5% sequence 
25 homology; amino acid sequence differences are 

indicated. One noted difference is the addition of a 
glycosylation site in the C ancestral env gene at 
the base of the VI loop. A plus sign indicates a 
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within-class difference of amino acid at the 
indicated position; a bar indicates a change in the 
class of amino, acid. Potential N-glycosylation sites 
are marked in blue. The position of truncation for 
the gpl4 0 gene is also shown. 

Figure 9 . Expression of subtype C ancestral 
and consensus envelopes in 2 93T cells. Plasmids 
containing codon-opt imized gpl60, gpl40, or gpl20 
subtype C ancestral and consensus genes were 
transfected into 293T cells, and protein expression 
was examined by Western Blot analysis of cell 
lysates. 48-hours post -transfect ion, cell lysates 
were collected, total protein content determined by 
the BCA protein assay, and 2 ng of total protein was 
loaded per lane on a 4-20% SDS-PAGE gel. Proteins 
were transferred to a PVDF membrane and probed with 
HIV-1 plasma from a subtype C infected patient. 

Figures 10A and 10B. Fig. 10A. Trans 
complementation of env-def icient HIV-1 with codon- 
opt imized subtype C ancestral and consensus gpl60 
and gp!40. Plasmids containing codon-optimized, 
subtype C ancestral or consensus gp!60 or gpl40 
genes were co- transfected into 293T cells with an 
HIV-l/SG3Aenv provirus. 4 8 hours post -transfect ion 
cell supernatants containing pseudotyped virus were 
harvested, clarified by centrif ugation, filtered 
through at 0.2|aM filter, and pelleted through a 20% 
sucrose cushion. Quantification of p24 in each 
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virus pellet was determined using the Coulter HIV-1 
p24 antigen assay; 25ng of p24 was loaded per lane 
on a 4-20% SDS-PAGE gel for particles containing a 
codon-optimized envelope. 250ng of p24 was loaded 
per lane for particles generated by co- transf ection 
of a rev-dependent wild-type subtype C 96ZAM6 51env 
gene. Differences in the amount of p24 loaded per 
lane were necessary to ensure visualization of the 
rev- dependent envelopes by Western Blot. Proteins 
were transferred to a PVDF membrane and probed with 
pooled plasma from HIV-1 subtype B and subtype C 
infected individuals. Fig. 10B. Infectivity of 
virus particles containing subtype C ancestral and 
consensus envelope glycoproteins. Infectivity of 
pseudo typed virus containing ancestral or consensus 
gp!60 or gpl40 envelope was determined using the 
JC53-BIj assay. Sucrose cushion purified virus 
particles were assayed by the Coulter p24 antigen 
assay, and 5-fold serial dilutions of each pellet 
were incubated with DEAE-Dextran treated JC53-BL 
cells. Following a 48-hour incubation period, cells 
were fixed and stained to visualize p-galactosidase 
expressing cells. Infectivity is represented as 
infectious units per ng of p24 to normalize for 
differences in the concentration of the input 
pseudovirions . 

Figure 11. Co-receptor usage of subtype C 
ancestral and consensus envelopes. Pseudotyped 
particles containing ancestral or consensus envelope 



were incubated with DEAE-Dextran treated JC53-BL 
cells in the presence of AMD3100 (a specific 
inhibitor of CXCR4) , TAK779 (a specific inhibitor of 
CCR5) , or AMD3 0 00+TAK77 9 to determine co-receptor 
5 usage. NL4 . 3 , an isolate known to utilize CXCR4 , 

and YU-2 , a known CCR5 -using isolate, were included 
as controls . 

Figures 12A-12C. Neutralization sensitivity of 
subtype C ancestral and consensus envelope 

10 glycoproteins. Equivalent amounts of pseudovirions 
containing the ancestral, consensus or 96ZAM651 
gpl60 envelopes (1,500 infectious units) were pre- 
incubated with a panel of plasma samples from HIV-1 
subtype C infected patients and then added to the 

15 JC53-BL cell monolayer in 96-well plates. Plates 
were cultured for two days and luciferase activity 
was measured as an indicator of viral infectivity. 
Virus infectivity is calculated by dividing the 
luciferase units (LU) produced at each concentration 

20 of antibody by the LU produced by the control 

infection. The mean 50% inhibitory concentration 
(IC 50 ) and the actual % neutralization at each 
antibody dilution are then calculated for each 
virus. The results of all luciferase experiments 

25 are confirmed by direct counting of blue foci in 
paral lei infections . 

Figures 13A-13F. Protein expression of 
consensus subtype C Gag (Fig. 13A) and Nef (Fig. 
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13B) following transfection into 293T cells. 
Consensus subtype C Gag and Nef amino acid sequences 
are set forth in Figs. 13C and 13D, respectively, 
and encoding sequences are set forth in Figs. 13E 
and 1 3 F , respectively . 

Figures 14A-14C. Figs. 14A and 14B show the 
Cons Env amino acid sequence and encoding sequence, 
respectively. Fig. 14C shows expression of Group M 
consensus CONs Env proteins using an in vitro 
transcription and translation system. 

Figures 15A and 15B. Expression of CONs env 
gene in mammalian cells. (Fig. 15A - cell lysate, 
Fig. 15B - supernatant.) 

Figures 16A and 16B. Infectivity (Fig. 16A) 
and coreceptor usage (Fig. 16B) of CON6 and CONs env 
genes . 

Figures 17A-17C. Env protein incorporation in 
CONG and CONs Env-pseudovirions . (Fig. 17A - 
lysate, Fig. 17B - supernatant, Fig. 17C pellet.) 

Figures 18A-18D. Figs. 18A and 18B show 
subtype A consensus Env amino acid sequence and 
nucleic acid sequence encoding same, respectively. 
Figs. 18C and 18D show expression of A. con env gene 
in mammalian cells (Fig. 18C - cell lysate, Fig. 18D 
- supernatant) . 
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Figures 19A-19H. M. con. gag (Fig. 19A) , 
M.con.pol (Fig. 19B) , M.con.nef (Fig. 19C) and 
C.con.pol (Fig. 19D) nucleic acid sequences and 
corresponding encoded amino acid sequences (Figs. 
19E-19H, respectively) . 

Figures 2 0A-2 0D. Subtype B consensus gag (Fig. 
20A) and env (Fig.20B) genes. Corresponding amino 
acid sequences are shown in Figs. 2 0C and 2 0D. 

Figure 21. Expression of subtype B consensus 
env and gag genes in 293T cells. Plasmids 
containing codon-optimized subtype B consensus 
gplSO, gpl40, and gag genes were transfected into 
2 93T cells, and protein expression was examined by 
Western Blot analysis of cell lysates . 48-hours 
post-transfection, cell lysates were collected, 
total protein content determined by the BCA protein 
assay, and 2 /zg of total protein was loaded per lane 
on a 4-20% SDS-PAGE gel. Proteins were transferred 
to a PVDF membrane and probed with serum from an 
HIV-1 subtype B infected individual. 

Figure 22. Co-receptor usage of subtype B 
consensus envelopes. Pseudotyped particles 
containing the subtype B consensus gpl60 Env were 
incubated with DEAE-Dextran treated JC53-BL cells in 
the presence of AMD3100 (a specific inhibitor of 
CXCR4), TAK779 (a specific inhibitor of CCR5) , and 
AMD3 000+TAK779 to determine co-receptor usage. 
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NL4.3, an isolate known to utilize CXCR4 and YU-2, a 
known CCR5-using isolate, were included as controls. 

Figures 23A and 23B. Trans complementation of 
env- deficient HIV-1 with codon-optimized subtype B 
5 consensus gpX60 and gpl40 genes. Plasmids 

containing codon-optimized, subtype B consensus 
gplGQ or gp!40 genes were co- transf ected into 2 93T 
cells with an HIV- l/SG3Aenv provirus. 4 8 -hours 
post- transf ection cell supernatants containing 

10 pseudotyped virus were harvested, clarified in a 
tabletop centrifuge, filtered through a 0.2/iM 
filter, and pellet through a 20% sucrose cushion. 
Quantification of p24 in each virus pellet was 
determined using the Coulter HIV-1 p24 antigen 

15 assay; 2 5 ng of p24 was loaded per lane on a 4-20% 
SDS-PAGE gel. Proteins were transferred to a PVDF 
membrane and probed with ant i -HIV-1 antibodies from 
infected HIV-1 subtype B patient serum. Trans 
complementation with a rev-dependent NL4 . 3 env was 

20 included for control. Figure 23B. Infectivity of 
virus particles containing the subtype B concensus 
envelope. Infectivitiy of pseudotyped virus 
containing consensus B gpl60 or gpl4 0 was determined 
using the JC53-BIj assay. Sucrose cushion purified 

25 virus particles were assayed by the Coulter p24 

antigen assay, and 5- fold serial dilutions of each 
pellet were incubated with DEAE-Dextran treated 
JC53-BL cells. Following a 48-hour incubation 
period, cells were fixed and stained to visualize P~ 
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galactosidase expressing cells. Infectivity is 
expressed as infectious units per ng of p24 . 

Figures 24A-24D. Neutralization sensitivity of 
virions containing subtype B consensus gp!60 
5 envelope. Equivalent amounts of pseudovirions 
containing the subtype B consensus or NL4 . 3 Env 
(gpl60) (1,500 infectious units) were preincubated 
with three different monoclonal neutralizing 
antibodies and a panel of plasma samples from HIV-1 

10 wubtype B infected individuals, and then added to 
the JC53-BL cell monolayer in 96-well plates. 
Plates were cultured for two days and luciferase 
activity was measured as an indicator of viral 
infectivity. Virus infectivity was calculated by 

15 dividing the luciferase units (L»U) produced at each 
concentration of antibody by the LU produced by the 
control infection. The mean 50% inhibitory 
concentration (IC 50 ) and the actual % neutralization 
at each antibody dilution were then calculated for 

20 each virus. The results of all luciferase 

experiments were confirmed by direct counting of 
blue foci in parallel infections. Fig. 24A. 
Neutralization of Pseudovirions containing Subtype B 
consensus Env (gpl60) . Fig. 24B. Neutralization of 

25 Pseudovirions containing NL4 . 3 Env (gpl60) . 

Fig. 24C. Neutralization of Pseudovirions containing 
Subtype B consensus Env (gp!60) . Fig. 24D. 
Neutralization of Pseudovirions containing NL4 . 3 Env 
(gpl60) . 
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Figures 25A and 25B. Fig. 25A. Density and p24 
analysis of sucrose gradient fractions. 0.5ml 
fractions were collected from a 20-60% sucrose 
gradient. Fraction number 1 represents the most 
5 dense fraction taken from the bottom of the gradient 
tube. Density was measured with a ref ractometer and 
the amount of p24 in each fraction was determined by 
the Coulter p24 antigen assay. Fractions 6-9, 10- 
15, 16-21 , and 22-25 were pooled together and 

10 analyzed by Western Blot. As expected, virions 
sedimented at a density of 1.16-1.18 g/ml . 
Fig. 25B. VLP production by co- transf ect ion of 
subtype B consensus gag and env genes. 293T cells 
were co- transf ected with subtype B consensus gag and 

15 env genes. Cell supernatants were harvested 48- 
hours post-transf ection, clarified through at 20% 
sucrose cushion, and further purified through a 20- 
60% sucrose gradient. Select fractions from the 
gradient were pooled, added to 2 0ml of PBS, and 

20 centrifuged overnight at 100,000 x g. Resuspended 
pellets were loaded onto a 4-20% SDS-PAGE gel, 
proteins were transferred to a PVDF membrane, and 
probed with plasma from an HIV-1 subtype B infected 
individual . 

25 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an immunogen 
that induces antibodies that neutralize a wide 
spectrum of human immunodeficiency virus (HIV) 
primary isolates and/or that induces a T cell 
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response. The immunogen comprises at least one 
consensus or ancestral immunogen (e.g., Env, Gag, 
Nef or Pol), or portion or variant thereof. The 
invention also relates to nucleic acid sequences 
5 encoding the consensus or ancestral immunogen, or 
portion or variant thereof. The invention further 
relates to methods of using both the immunogen and 
the encoding sequences. While the invention is 
described in detail with reference to specific 

10 consensus and ancestral immunogens (for example, to 
a group M consensus Env) , it will be appreciated 
that the approach described herein can be used to 
generate a variety of consensus or ancestral 
immunogens (for example, envelopes for other HIV-1 

15 groups (e.g., N and O) ) . 

In accordance with one embodiment of the 
invention, a consensus env gene can be constructed 
by generating consensus sequences of env genes for 
each subtype of a particular HIV-1 group (group M 

20 being classified into subtypes A-D, F-H, J an K) , 
for example, from sequences in the Los Alamos HIV 
Sequence Database (using, for example, MASE 
(Multiple Aligned Sequence Editor) ) . A consensus 
sequence of all subtype consensuses can then be 

25 generated to avoid heavily sequenced subtypes 

(Gaschen et al , Science 296:2354-2360 (2002), Korber 
et al, Science 288:1789-1796 (2000)). In the case 
of the group M consensus env gene described in 
Example 1 (designated CON6) , five highly variable 

30 regions from a CRF08_BC recombinant strain (98CN006) 
(VI, V2, V4, V5 and a region in cytoplasmic domain 
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of gp41) are used to fill in the missing regions in 
the sequence (see, however, corresponding regions 
for CONs) . For high levels of expression, the 
codons of consensus or ancestral genes can be 
5 optimized based on codon usage for highly expressed 
human genes (Haas et al, Curr. Biol, 6:315-324 
(2000), Andre et al, J. Virol. 72:1497-1503 (1998)). 

It will be appreciated that the invention 
includes portions and variants of the sequences 
10 specifically disclosed herein. For example, and as 
regards the consensus and ancestral envelope 
sequences, the invention encompasses envelope 
sequences devoid of V3 . Alternatively, V3 sequences 
can be selected from preferred sequences, for 
15 example, those described in U.S. Application No. 
10/431,596 and U.S. Provisional Application 
No. 60/471,327. 

A consensus or ancestral envelope of the 
invention can be been "activated" to expose 
20 intermediate conformations of neutralization 

epitopes that normally are only transiently or less 
well exposed on the surface of the HIV virion. The 
immunogen can be a "frozen" triggered form of a 
consensus or ancestral envelope that makes available 
25 specific epitopes for presentation to B lymphocytes. 
The result of this epitope presentation is the 
production of antibodies that broadly neutralize 
HIV. (Attention is directed to WO 02/024149 and to 
the activated/triggered envelopes described 
30 therein. ) 



19 



766537 



The concept of a fusion intermediate immunogen 
is consistent with observations that the gp41 HR-2 
region peptide , DPI 78, can capture an uncoiled 
conformation of gp41 (Furata et al , Nature Struct. 
5 Biol. 5:276 (1998)), and that formalin- fixed HIV- 
infected cells can generate broadly neutralizing 
antibodies (LaCasse et al, Science 283:357 (1997)). 
Recently a monoclonal antibody against the coiled- 
coil region bound to a conformational determinant of 

10 gp41 in HR1 and HR2 regions of the coiled-coil gp41 
structure, but did not neutralize HIV (Jiang et al , 
J. Virol. 10213 (1998)). However, this latter study 
proved that the coiled-coil region is available for 
antibody to bind if the correct antibody is 

15 generated. 

The immunogen of one aspect of the invention 
comprises a consensus or ancestral envelope either 
in soluble form or anchored, for example, in cell 
vesicles or in liposomes containing translipid 

20 bilayer envelope. To make a more native envelope, 

gpl40 or gpl60 consensus or ancestral sequences can 
be configured in lipid bilayers for native trimeric 
envelope formation. Alternatively, triggered gpl60 
in aldrithio 1-2 inactivated HIV-1 virions can be 

2 5 used as an immunogen. The gp!60 can also exist as a 
recombinant protein either as gp!60 or gpl40 (gpl40 
is gpl60 with the transmembrane region and possibly 
other gp41 regions deleted) . Bound to gpl60 or 
gpl40 can be recombinant CCR5 or CXCR4 co-receptor 

30 proteins (or their extracellular domain peptide or 
protein fragments) or antibodies or other ligands 
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that bind to the CXCR4 or CCR5 binding site on 
gp!2 0 # and/or soluble CD4 , or antibodies or other 
ligands that mimic the binding actions of CD4 . 
Alternatively, vesicles or liposomes containing CD4 , 
5 CCR5 (or CXCR4) , or soluble CD4 and peptides 

reflective of CCR5 or CXCR4 gpl20 binding sites. 
Alternatively, an optimal CCR5 peptide ligand can be 
a peptide from the N-terminus of CCR5 wherein 
specific tyrosines are sulfated (Bormier et al , 

10 Proc. Natl. Acad. Sci . USA 97:5762 (2001)). The 
triggered immunogen may not need to be bound to a 
membrane but may exist and be triggered in solution. 
Alternatively, soluble CD4 (sCD4) can be replaced by 
an envelope (gpl40 or gpl60) triggered by CD4 

15 peptide mimetopes (Vitra et al, Proc. Natl. Acad. 
Sci. USA 96:1301 (1999)). Other HIV co-receptor 
molecules that "trigger" the gpl60 or gpl40 to 
undergo changes associated with a structure of gpl60 
that induces cell fusion can also be used. Ligation 

20 of soluble HIV gpl40 primary isolate HIV 89.6 
envelope with soluble CD4 (sCD4) induced 
conformational changes in gp41. 

In one embodiment, the invention relates to an 
immunogen that has the characteristics of a receptor 

25 (CD4) -ligated consensus or ancestral envelope with 
CCR5 binding region exposed but unlike CD4- ligated 
proteins that have the CD4 binding site blocked, 
this immunogen has the CD4 binding site exposed 
(open) . Moreover, this immunogen can be devoid of 

30 host CD4, which avoids the production of potentially 
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harmful anti-CD4 antibodies upon administration to a 
host . 

The immunogen can comprise consensus or 
ancestral envelope ligated with a ligand that binds 
5 to a site on gpl20 recognized by an A32 monoclonal 
antibodies (mab) (Wyatt et al, J. Virol. 69:5723 
(1995), Boots et al, AIDS Res. Hum. Retro. 13:1549 
(1997), Moore et al , J. Virol. 68:8350 (1994), 
Sullivan et al, J". Virol. 72:4694 (1998), Fouts et 
io al, J. Virol. 71:2779 (1997), Ye et al , J. Virol. 
74:11955 (2000)) . One A32 mab has been shown to 
mimic CD4 and when bound to gpl2 0, upregulates 
(exposes) the CCR5 binding site (Wyatt et al , J . 
Virol. 69:5723 (1995)). Ligation of gpl20 with such 
15 a ligand also upregulates the CD4 binding site and 
does not block CD4 binding to gpl20. 

Advantageously, such ligands also upregulate the HR- 
2 binding site of gp41 bound to cleaved gp!2 0, 
uncleaved gpl4 0 and cleaved gp41, thereby further 

20 exposing HR-2 binding sites on these proteins - each 
of which are potential targets for anti-HIV 
neutralizing antibodies . 

In a specific aspect of this embodiment, the 
immunogen comprises soluble HIV consensus or 

25 ancestral gpl2 0 envelope ligated with either an 

intact A3 2 mab, a Fab2 fragment of an A3 2 mab, or a 
Fab fragment of an A32 mab, with the result that the 
CD4 binding site, the CCR5 binding site and the HR-2 
binding site on the consensus or ancestral envelope 

3 0 are exposed/up regulated . The immunogen can comprise 
consensus or ancestral envelope with an A32 mab (or 
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fragment thereof) bound or can comprise consensus or 
ancestral envelope with an A32 mab (or fragment 
thereof) bound and cross-linked with a cross-linker 
such as .3% formaldehyde or a heterobif unctional 
5 cross -linker such as DTSSP (Pierce Chemical 

Company) . The immunogen can also comprise uncleaved 
consensus or ancestral gpl4 0 or a mixture of 
uncleaved gpl40 / cleaved gp41 and cleaved gpl20. An 
A3 2 mab (or fragment thereof) bound to consensus or 
10 ancestral gpl40 and/or gpl20 or to gpl2 0 non- 

covalently bound to gp41, results in upregulation 
(exposure) of HR-2 binding sites in gp41, gpl20 and 
uncleaved gpl40 . Binding of an A32 mab (or fragment 
thereof) to gpl20 or gpl40 also results in 
15 upregulation of the CD4 binding site and the CCR5 
binding site. As with gpl20 containing complexes, 
complexes comprising uncleaved gpl4 0 and an A32 mab 
(or fragment thereof) can be used as an immunogen 
uncross -linked or cross-linked with cross-linker 
20 such as .3% formaldehyde or DTSSP. In one 

embodiment, the invention relates to an immunogen 
comprising soluble uncleaved consensus or ancestral 
gpl40 bound and cross linked to a Fab fragment of an 
A3 2 mab, optionally bound and cross- linked to an HR- 
25 2 binding protein. 

The consensus or ancestral envelope protein 
triggered with a ligand that binds to the A32 mab 
binding site on gpl20 can be administered in 
combination with at least a second immunogen 
30 comprising a second envelope, triggered by a ligand 
that binds to a site distinct from the A32 mab 
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binding site, such as the CCR5 binding site 
recognized by mab 17b. The 17b mab (Kwong et al, 
Nature 393:648 (1998) available from the AIDS 
Reference Repository, NIAID, NIH) augments sCD4 
5 binding to gpl20. This second immunogen (which can 
also be used alone or in combination with triggered 
immunogens other than that described above) can, for 
example, comprise soluble HIV consensus or ancestral 
envelope ligated with either the whole 17b mab, a 

10 Fab2 fragment of the 17b mab, or a Fab fragment of 

the 17b mab. It will be appreciated that other CCR5 
ligands, including other antibodies (or fragments 
thereof) , that result in the CD4 binding site being 
exposed can be used in lieu of the 17b mab. This 

15 further immunogen can comprise gpl2 0 with the 17b 
mab, or fragment thereof, (or other CCR5 ligand as 
indicated above) bound or can comprise gpl2 0 with 
the 17b mab, or fragment thereof, (or other CCR5 
ligand as indicated above) bound and cross -linked 

20 with an agent such as .3% formaldehyde or a 

heterobi functional cross- linker , such as DTSSP 
(Pierce Chemical Company) . Alternatively, this 
further immunogen can comprise uncleaved gpl4 0 
present alone or in a mixture of cleaved gp41 and 

25 cleaved gpl20. Mab 17b, or fragment thereof (or 

other CCR5 ligand as indicated above) bound to gpl4 0 
and/or gpl2 0 in such a mixture results in exposure 
of the CD 4 binding region. The 17b mab, or fragment 
thereof, (or other CCR5 ligand as indicated above) 

30 gpl4 0 complexes can be present uncross -linked or 
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cross-linked with an agent such as .3% formaldehyde 
or DTSSP. 

Soluble HR-2 peptides, such as T649Q26L and 
DP178, can be added to the above -described complexes 
5 to stabilize epitopes on consensus gpl20 and gp41 as 
well as uncleaved consensus gpl40 molecules, and can 
be administered either cross-linked or uncross- 
linked with the complex. 

A series of monoclonal antibodies (mabs) have 

10 been made that neutralize many HIV primary isolates, 
including, in addition to the 17b mab described 
above, mab IgGlbl2 that binds to the CD4 binding 
site on gpl20 (Roben et al , J. Virol. 68:482 (1994), 
Mo et al, J*. Virol. 71:6869 (1997)), mab 2G12 that 

15 binds to a conformational determinant on gpl2 0 

(Trkola et al , J. Virol. 70:1100 (1996)), and mab 
2F5 that binds to a membrane proximal region of gp41 
(Muster et al, J. Virol. 68:4031 (1994)). 

As indicated above, various approaches can be 

20 used to "freeze" fusogenic epitopes in accordance 

with the invention. For example, "freezing" can be 
effected by addition of the DP-178 or T-649Q26L 
peptides that represent portions of the coiled coil 
region, and that when added to CD4 -triggered 

25 consensus or ancestral envelope, result in 

prevention of fusion (Rimsky et al, J*. Virol. 
72:986-993 (1998)). HR-2 peptide bound consensus or 
ancestral gpl20, gpl40, gp41 or gpl60 can be used as 
an immunogen or crosslinked by a reagent such as 

30 DTSSP or DSP (Pierce Co.), formaldehyde or other 
crosslinking agent that has a similar effect. 
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"Freezing" can also be effected by the addition 
of 0.1% to 3% formaldehyde or paraformaldehyde, both 
protein cross- linking agents, to the complex, to 
stabilize the CD4 , CCR5 or CXCR4 , HR-2 peptide gpl60 
5 complex, or to stabilize the "triggered" gp41 

molecule, or both (LaCasse et al, Science 283:357- 
362 (1999) ) . 

Further, "freezing" of consensus or ancestral 
gp41 or gpl2 0 fusion intermediates can be effected 

10 by addition of heterobif unct ional agents such as DSP 
(dithiobis [succimidylproprionate] ) (Pierce Co. 
Rockford, ILL., No. 22585ZZ) or the water soluble 
DTSSP (Pierce Co.) that use two NHS esters that are 
reactive with amino groups to cross link and 

15 stabilize the CD4 , CCR5 or CXCR4 , HR-2 peptide gpl60 
complex, or to stabilize the "triggered" gp41 
molecule, or both. 

Analysis of T cell immune responses in 
immunized or vaccinated animals and humans shows 

20 that the envelope protein is normally not a main 

target for T cell immune response although it is the 
only gene that induces neutralizing antibodies. 
HIV-1 Gag, Pol and Nef proteins induce a potent T 
cell immune response. Accordingly, the invention 

25 includes a repertoire of consensus or ancestral 

immunogens that can induce both humoral and cellular 
immune responses. Subunits of consensus or 
ancestral sequences can be used as T or B cell 
immunogens . 

3 0 The immunogen of the invention can be 

formulated with a pharmaceutical ly acceptable 
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carrier and/or adjuvant (such as alum) using 
techniques well known in the art. Suitable routes 
of administration of the present immunogen include 
systemic (e.g. intramuscular or subcutaneous). 
5 Alternative routes can be used when an immune 

response is sought in a mucosal immune system (e.g., 
intranasal) . 

The immunogens of the invention can be 
chemically synthesized and purified using methods 

10 which are well known to the ordinarily skilled 

artisan. The immunogens can also be synthesized by 
well-known recombinant DNA techniques. Nucleic 
acids encoding the immunogens of the invention can 
be used as components of, for example, a DNA vaccine 

is wherein the encoding sequence is administered as 

naked DNA or, for example, a minigene encoding the 
immunogen can be present in a viral vector. The 
encoding sequence can be present, for example, in a 
replicating or non-replicating adenoviral vector, an 

20 adeno-associated virus vector, an attenuated 
mycobacterium tuberculosis vector, a Bacillus 
Calmette Guerin (BCG) vector, a vaccinia or Modified 
Vaccinia Ankara (MVA) vector, another pox virus 
vector, recombinant polio and other enteric virus 

25 vector, Salmonella species bacterial vector. 

Shigella species bacterial vector, Venezuelean 
Equine Encephalitis Virus (VEE) vector, a Semliki 
Forest Virus vector, or a Tobacco Mosaic Virus 
vector. The encoding sequence, can also be 

30 expressed as a DNA plasmid with, for example, an 

active promoter such as a CMV promoter. Other live 
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vectors can also be used to express the sequences of 
the invention. Expression of the immunogen of the 
invention can be induced in a patient's own cells, 
by introduction into those cells of nucleic acids 
5 that encode the immunogen, preferably using codons 
and promoters that optimize expression in human 
cells. Examples of methods of making and using DNA 
vaccines are disclosed in U.S. Pat. Nos . 5,580,859, 
5,589,466, and 5,703,055. 1 

10 The composition of the invention comprises an 

immunologically effective amount of the immunogen of 
this invention, or nucleic acid sequence encoding 
same, in a pharmaceutical ly acceptable delivery 
system. The compositions can be used for prevention 

15 and/or treatment of immunodeficiency virus 

infection. The compositions of the invention can be 
formulated using adjuvants, emulsifiers, 
pharmaceutically-acceptable carriers or other 
ingredients routinely provided in vaccine 

20 compositions. Optimum formulations can be readily 

designed by one of ordinary skill in the art and can 
include formulations for immediate release and/or 
for sustained release, and for induction of systemic 
immunity and/or induction of localized mucosal 

25 immunity (e.g, the formulation can be designed for 
intranasal administration) . The present 
compositions can be administered by any convenient 
route including subcutaneous, intranasal, oral, 
intramuscular, or other parenteral or enteral route. 

30 The immunogens can be administered as a single dose 
or multiple doses. Optimum immunization schedules 
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can be readily determined by the ordinarily skilled 
artisan and can vary with the patient, the 
composition and the effect sought . 

The invention contemplates the direct use of 
5 both the immunogen of the invention and/or nucleic 
acids encoding same and/or the immunogen expressed 
as minigenes in the vectors indicated above. For 
example, a minigene encoding the immunogen can be 
used as a prime and/or boost . 
10 Certain aspects of the invention can be 

described in greater detail in the non- limiting 
Examples that follows . 

EXAMPLE 1 

Artificial HIV-1 Group M Consensus Envelope 
15 EXPERIMENTAL DETAILS 

Expression of CON6 gpl20 and gpl40 proteins in 
recombinant: vaccinia, viruses (W) . To express and 
purify the secreted form of HIV-1 CON6 envelope 

20 proteins, CON6 gpl2 0 and gpl4 0CF plasmids were 

constructed by introducing stop codons after the 
gpl2 0 cleavage site (REKR) and before the 
transmembrane domain ( YIKIFIMIVGGLIGLRIVFAVLSIVN) , 
respectively. The gpl20/gp41 cleavage site and 

25 fusion domain of gp41 were deleted in the gpl4 0CF 

protein. Both CON6 gpl20 and gpl40CF DNA constructs 
were cloned into the pSC63- vector (from Bernard 
Moss, NIH, Bethesda, MD) at Sail and Kpnl 
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restriction enzyme sites. This vector contains the 
lacZ gene that is controlled by the p7 . 5 promoter. 
A back-to-back P E/L promoter was used to express 
CONG env genes. BSC-1 cells were seeded at 2 x 10 s 
5 in each well in a 6-well plate, infected with wild- 
type vaccinia virus (WR) at a MOI of 0.1 pfu/cell, 
and 2 hr after infection, pSC65-derived plasmids 
containing CON6 env genes were transfected into the 
W-infected cells and recombinant (r) W selected as 

10 described (Moss and Earl, Current Protocols in 

Molecular Biology, eds, Ausubel et al (John Wiley & 
Sons, Inc. Indianapolis, IN) pp. 16.15.1-16.19.9 
(1998)). Recombinant W that contained the CON6 env 
genes were confirmed by PCR and sequencing analysis. 

15 Expression of the CON6 envelope proteins was 
confirmed by SDS-PAGE and Western blot assay. 
Recombinant CON6 gpl2 0 and gpl4 0CF were purified 
with agarose galanthus Nivalis lectin beads (Vector 
Labs, Burlingame, CA) , and stored at -70°C until use. 

20 Recombinant W expressing JRFL (vCB-28) or 96ZM651 
(vT241R) gpl60 were obtained from the NIH AIDS 
Research and Reference Reagent Program (Bethesda, 
MD) . 

25 Monoclonal Antibodies and gpl20 Wild- type 

Envelopes. Human mabs against a conformational 
determinant on gpl20 (A32) , the gpl20 V3 loop (F39F) 
and the CCR5 binding site (17b) were the gifts of 
James Robinson (Tulane Medical School, New Orleans, 

30 LA) (Wyatt et al , Nature 393/705-711 (1998), Wyatt 
et al, J . Virol. 69:5723-5733 (1995)). Mabs 2F5, 
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447 , bl2, 2G12 and soluable CD4 were obtained from 
the NIH AIDS Research and Reference Reagent Program 
(Bethesda, MD) (Gorny e tal / J. Immunol. 159:5114- 
5122 (1997), Nyambi et al , J. Virol. 70:6235-6243 
5 (1996), Purtscher et al , AIDS Res. Hum. Retroviruses 
10:1651-1658 (1994), Trkola et al, J . Virol 70:1100- 
1108 (1996) ) . T8 is a murine mab that maps to the 
gpl20 CI region (a gift from P. Earl, NIH, Bethesda, 
MD) . BaL (subtype B) , 96ZM651 (subtype C) , and 
10 93TH975 (subtype E) gpl20s were provided by QBI , 

Inc. and the Division of AIDS, NIH. CHO cell lines 
that express 92U037 (subtype A) and 93BR029 (subtype 
F) gpl4 0 (secreted and uncleaved) were obtained from 
NICBS, England. 

15 

Su-r^ace Plasmon Resonance Biosensor (SPR) 
Measurements and EJLISA. SPR biosensor measurements 
were determined on a BIAcore 3 000 instrument 
(BIAcore Inc., Uppsala, Sweden) instrument and data 

20 analysis was performed using BIAevaluation 3.0 

software (BIAcore Inc, Upsaala, Sweden) . Anti-gpl20 
mabs (T8, A32, 17b, 2G12) or sCD4 in lOmM Na-acetate 
buffer, pH 4.5 were directly immobilized to a CM 5 
sensor chip using a standard amine coupling protocol 

25 for protein immobilization. FPLC purified CON6 
gpl20 monomer or gpl40CF oligomer recombinant 
proteins were flowed over CM 5 sensor chips at 
concentrations of 100 and 300 /zg/ml , respectively. 
A blank in-line reference surface (activated and de- 

30 activated for amine coupling) or non-bonding mab 

controls were used to subtract non-specific or bulk 
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responses . Soluble 89.6 gpl20 and irrelevant IgG 
was used as a positive and negative control 
respectively and to ensure activity of each mab 
surface prior to injecting the CON6 Env proteins. 
5 Binding of CON6 envelope proteins was monitored in 
real-time at 25°C with a continuous flow of PBS (150 
mM NaCl, 0.005% surfactant P20), pH 7.4 at 10-30 
/xl/min. Bound proteins were removed and the sensor 
surfaces were regenerated following each cycle of 
10 binding by single or duplicate 5-10 |il pulses of 

regeneration solution (10 mM glycine-HCl, pH 2.9). 
ELISA was performed to determine the reactivity of 
various mabs to CON6 gpl2 0 and gpl4 0CF proteins as 
described (Haynes et al , AIDS Res. Hum. Retroviruses 
15 11:211-221 (1995)). For assay of human mab binding 
to rgpl20 or gpl40 proteins, end-point titers were 
defined as the highest titer of mab (beginning at 20 
/ig/ml) at which the mab bound CON6 gpl2 0 and gpl40CF 
Env proteins 3 fold over background control (non- 
20 binding human mab) . 

Infect! vi ty and coreceptor uaage aaaaya. HIV- 
l/SG3Aenv and CON6 or control env plasmids were 
cotransf ected into human 293T cells. Pseudotyped 

25 viruses were harvested, filtered and p24 

concentration was quantitated (DuPont/NEN Life 
Sciences, Boston, MA) . Equal amounts of p24 (5 ng) 
for each pseudovirion were used to infect JC53-BIj 
cells to determine the infectivity (Derdeyn e al, J . 

30 Virol. 74:8358-8367 (2000), Wei et al, Antimicrob 
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Agents Chemother. 46:1896-1905 (2002)). JC53-BL. 
cells express CD4 , CCR5 and CXCR4 receptors and 
contain a p-galactosidase (P-gal) gene stably 
integrated under the transcriptional control of an 
HIV-1 long terminal repeat (LTR) . These cells can 
be used to quantify the infectious titers of 
pseudovirion stocks by staining for P-gal expression 
and counting the number of blue cells (infectious 
units) per microgram of p24 of pseudovirons (IU/jzg 
p24) (Derdeyn e al, J. Virol. 74:8358-8367 (2000), 
Wei et al, Antimicrob Agents Chemother. 46:18 96-1905 
(2002)). To determine the coreceptor usage of the 
CON6 env gene, JC53BL cells were treated with 1.2 /zM 
AMD3100 and 4 /xM TAK-7 99 for 1 hr at 3 7°C then 
infected with equal amounts of p24 (5 ng) of each 
Env pseudotyped virus. The blockage efficiency was 
expressed as the percentage of the infectious units 
from blockage experiments compared to that from 
control culture without blocking agents. The 
infectivity from control group (no blocking agent) 
was arbitrarily set as 100%. 

Immunizations . All animals were housed in the 
Duke University Animal Facility under AALAC 
guidelines with animal use protocols approved by the 
Duke University Animal Use and Care Committee. 
Recombinant CON6 gpl2 0 and gpl40CF glycoproteins 
were formulated in a stable emulsion with RIBI-CWS 
adjuvant based on the protocol provided by the 
manufacturer (Sigma Chemical Co., St. Louis, MO). 
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For induction of ant i -envelope antibodies, each of 
four out-bred guinea pigs (Harlan Sprague, Inc., 
Chicago, IL) was given 10 0 ^g either purified CON6 
gpl20 or gpl40CF subcutaneously every 3 weeks (total 
5 of 5 immunizations) . Serum samples were heat- 

inactivated (56°C, 1 hr) , and stored at -20°C until 
use . 

For induction of anti-envelope T cell 
responses, 6-8 wk old female B ALB / c mice (Frederick 

10 Cancer Research and Developmental Center, NCI, 

Frederick, MD) were immunized i.m. in the quadriceps 
with 50 /ig plasmid DNA three times at a 3 -week 
interval . Three weeks after the last DNA 
immunization, mice were boosted with 10 7 PFU of rW 

15 expressing Env proteins. Two weeks after the boost, 
all mice were euthanized and spleens were removed 
for isolation of splenocytes . 

Neutralization assays. Neutralization assays 
20 were performed using either a MT-2 assay as 
described in Bures et al, AIDS Res. Hum. 
Retroviruses 16:2019-2035 (2000), a lucif erase-based 
multiple replication cycle HIV-1 infectivity assay 
in 5.25. GFP . Luc . M7 cells using a panel of HIV-1 
25 primary isolates (Bures et al , AIDS Res. Hum. 

Retroviruses 16:2019-2035 (2000), Bures et al, J". 
Virol. 76:2233-2244 (2002) ) # or a syncytium (fusion 
from without) inhibition assay using inactivated 
HIV-1 virions (Rossio et al, J. Virol. 72:7992-8001 
30 (1998) ) . In the lucif erase-based assay, 



34 



766537 



neutralizing antibodies were measured as a function 
of a reduction in luciferase acitivity in 
5.2 5.EGFP.L.UC.M7 cells provided by Nathaniel R. 
Landau, Salk Institute, La Jolla, CA (Brandt et al, 
J. Biol. Chem. 277:17291-17299 (2002)). Five 
hundred tissue culture infectious dose 50 (TCID 50 ) of 
cell -free virus was incubated with indicated serum 
dilutions in 150 jal (1 hr, at 37°C) in triplicate in 
96-well flat-bottom culture plates. The 
5.25.EGFP.Luc.M7 cells were suspended at a density 
of 5 x 10 5 /ml in media containing DEAE dextran (10 
Hg/ml) . Cells (100 jal) were added and until 10% of 
cells in control wells (no test serum sample) were 
positive for GFP expression by fluorescence 
microscopy. At this time the cells were 
concentrated 2- fold by removing one -half volume of 
media. A 50 jj.1 suspension of cells was transferred 
to 96-well white solid plates (Costar, Cambridge, 
MA) for measurement of luciferase activity using 
Bright-Glo™ substrate (Promega, Madison, WI) on a 
Wallac 1420 Multilabel Counter (PerkinElmer Life 
Sciences, Boston, MA) . Neutralization titers in the 
MT-2 and luciferase assays were those where > 50% 
virus infection was inhibited. Only values that 
titered beyond 1:20 (i.e. >1:30) were considered 
significantly positive. The syncytium inhibition 
"fusion from without" assay utilized HIV-1 
aldrithiol-2 (AT-2) inactivated virions from HIV-1 
subtype B strains ADA and AD8 (the gift of Larry 
Arthur and Jeffrey Lifson, Frederick Research Cancer 



Facility, Frederick, MD) added to SupTl cells, with 
syncytium inhibition titers determined as those 
titers where >90% of syncytia were inhibited 
compared to prebleed sera. 

5 

Enzyme linked immune spot (ELISPOT) assay. 
Single-cell suspensions of splenocytes from 
individual immunized mice were prepared by mincing 
and forcing through a 70 Jim Nylon cell strainer (BD 

10 Labware, Franklin Lakes, NJ) . Overlapping Env 
peptides of CON6 gp!4 0 (15 9 peptides, 15mers 
overlapping by 11) were purchased from Boston 
Bioscence, Inc (Royal Oak, MI) . Overlapping Env 
peptides of MN gpl4 0 (subtype B; 170 peptides, 

15 ISmers overlapping by 11) and Chnl9 gpl40 (subtype 
C; 69 peptides, 20mers overlapping by 10) were 
obtained from the NIH AIDS Research and Reference 
Reagent Program (Bethesda, MD) . Splenocytes (5 
mice/group) from each mouse were stimulated in vitro 

2 0 with overlapping Env peptides pools from CON6, 

subtype B and subtype C Env proteins. 96-well PVDF 
plates (Multiscreen-IP, Millipore, Billerica, MA) 
were coated with anti-IFN-y mab (5 ^g/ml , AN18; 
Mabtech, Stockholm, Sweden) . After the plates were 

25 blocked at 37 C for 2 hr using complete Hepes 

buffered RPMI medium, 50^il of the pooled overlapping 
envelope peptides (13 CON6 and MN pools, 13-14 
peptides in each pool; 9 Chnl9 pool, 7-8 peptide in 
each pool) at a final concentration of 5 jig/ml of 

30 each were added to the plate. Then 50 pel of 
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splenocytes at a concentration of 1.0 X 10 7 /ml were 
added to the wells in duplicate and incubated for 16 
hr at 37 C with 5% C0 2 . The plates were incubated 
with 100 Ail of a 1:1000 dilution of streptavidin 
5 alkaline phosphatase (Mabtech, Stockholm, Sweden) , 
and purple spots developed using 10 0 fil of BCIP/NBT 
(Plus) Alkaline Phosphatase Substrate (Moss, 
Pasadena, MD) . Spot forming cells (SFC) were 
measured using an Immunospot counting system (CTL 
10 Analyzers, Cleveland, OH) . Total responses for each 
envelope peptide pool are expressed as SFCs per 10 6 
splenocytes . 

RESULTS 

15 

CON6 Envelope Gene Design, Construction and 
Expression. An artificial group M consensus env 
gene (CON6) was constructed by generating consensus 
sequences of env genes for each HIV-1 subtype from 

20 sequences in the Los Alamos HIV Sequence Database, 
and then generating a consensus sequence of all 
subtype consensuses to avoid heavily sequenced 
subtypes (Gaschen et al , Science 296:2354-2360 
(2002), Korber et al, Science 288:1789-1796 (2000)), 

25 Five highly variable regions from a CRF08_ BC 

recombinant strain (98CN006) (VI, V2 , V4 , V5 and a 
region in cytoplasmic domain of gp41) were then used 
to fill in the missing regions in CON6 sequence . 
The CON6 V3 region is group M consensus (Figure 1A) . 

30 For high levels of expression, the codons of CON6 
env gene were optimized based on codon usage for 
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highly expressed human genes (Haas et al , Curr. 
Biol. 6:315-324 (2000), Andre et al, J. Virol. 
72:1497-1503 (1998)). (See Fig. ID.) The codon 
optimized CON6 env gene was constructed and 
5 subcloned into pcDNA3 . 1 DNA at EcoR I and BamH I 
sites (Gao et al, AIDS Res. Hum. Retroviruses, 
19:817-823 (2003)). High levels of protein 
expression were confirmed with Western-blot assays 
after transfection into 293T cells. To obtain 

10 recombinant CON6 Env proteins for characterization 
and use as immunogens , rW was generated to express 
secreted gpl20 and uncleaved gpl40CF (Figure IB) . 
Purity for each protein was >90% as determined by 
Coomassie blue gels under reducing conditions 

15 (Figure 1C) . 

CD4 Binding Domain and Other Wild- type HIV-1 
Epitopes are Preserved on CON6 Proteins. To 
determine if CON6 proteins can bind to CD4 and 

20 express other wild-type HIV-1 epitopes, the ability 
of CON6 gpl20 and gpl40CF to bind soluble (s) CD4 , to 
bind several well -characterized anti-gpl2 0 mabs, and 
to undergo CD4- induced conformational changes was 
assayed. First, BIAcore CM5 sensor chips were 

25 coated with either sCD4 or mabs to monitor their 

binding activity to CON6 Env proteins. It was found 
that both monomer ic CON6 gpl2 0 and oligomer ic 
gpl4 0CF efficiently bound sCD4 and anti-gpl2 0 mabs 
T8, 2G12 and A32, but did not const itutively bind 

30 mab 17b, that recognizes a CD4 inducible epitope in 
the CCR5 binding site of gp!2 0 (Figures 2A and 2B) . 
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Both sCD4 and A3 2 can expose the 17b binding epitope 
after binding to wild-type gp!20 (Wyatt et al , 
Nature 393;705-711 (1998), Wyatt et al , J. Virol. 
69:5723-5733 (1995)). To determine if the 17b 
5 epitope could be induced on CON6 Envs by either sCD4 
or A32, sCD4, A32 and T8 were coated on sensor 
chips, then CON6 gpl20 or gpl40CF captured, and mab 
17b binding activity monitored. After binding sCD4 
or mab A32, both CON6 gpl20 and gpl40CF were 

io triggered to undergo conformational changes and 
bound mab 17b (Figures 2C and 2D) . In contrast, 
after binding mab T8 , the 17b epitope was not 
exposed (Figures 2C and 2D) . EL ISA was next used to 
determine the reactivity of a panel of human mabs 

15 against the gpl20 V3 loop (447, F39F) , the CD4 
binding site (bl2), and the gp41 neutralizing 
determinant (2F5) to CON6 gpl20 and gpl40CF (Figure 
2E) . Both CON6 rgpl2 0 and rgpl4 0CF proteins bound 
well to neutralizing V3 mabs 447 and F39F and to the 

20 potent neutralizing CD4 binding site mab bl2 . Mab 
2F5, that neutralizes HIV-1 primary isolates by 
binding to a C- terminal gp41 epitope, also bound 
well to CON6 gpl4 0CF (Figure 2E) . 

25 CON6 env Gene is Biologically Functional and 

Uses CCR5 as its Coreceptor. To determine whether 
CON6 envelope gene is biologically functional, it 
was co- transf ected with the env- de feet i ve SG3 
proviral clone into 293T cells. The pseudotyped 

30 viruses were harvested and JC53BIj cells infected. 
Blue cells were detected in *JC53-BLi cells infected 
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with the CON6 Env pseudovirions, suggesting that 
CON6 Env protein is biologically functional (Figure 
3A) . However, the infectious titers were 1-2 logs 
lower than that of pseudovirions with either YU2 or 
5 NL4-3 wild- type HIV-1 envelopes. 

The co- receptor usage for the CON6 env gene was 
next determined. When treated with CXCR4 blocking 
agent AMD3100, the infectivity of NL.4-3 Env- 
pseudovirons was blocked while the infectivity of 

10 YU2 or CON6 Env -pseudo virons was not inhibited 

(Figure 3B) . In contrast, when treated with CCR5 
blocking agent TAK-779, the infectivity of NL4-3 
Env-pseudovirons was not affected, while the 
infectivity of YU2 or CON6 Env-pseudovirons was 

is inhibited. When treated with both blocking agents, 
the infectivity of all pseudovirions was inhibited. 
Taken together, these data show that the CON6 
envelope uses the CCR5 co- receptor for its entry 
into target cells. 

20 

Reaction of CONG gpl20 With Different Subtype 
Sera. To determine if multiple subtype linear 
epitopes are preserved on CON6 gpl20, a recombinant 
Env protein panel (gpl2 0 and gpl4 0) was generated. 

25 Equal amounts of each Env protein (100 ng) were 

loaded on SDS-polyacrylamide gels, transferred to 
nitrocellulose, and reacted with subtype A through G 
patient sera as well as anti-CON6 gpl20 guinea pig 
sera (1:1,000 dilution) in Western blot assays. For 

30 each HIV-1 subtype, four to six patient sera were 
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tested. One serum representative for each subtype 
is shown in Figure 4. 

It was found that whereas all subtype sera 
tested showed variable reactivities among Envs in 
s the panel , all group M subtype patient sera reacted 
equally well with CON6 gpl20 Env protein, 
demonstrating that wild- type HIV-1 Env epitopes 
recognized by patient sera were well preserved on 
the CON6 Env protein. A test was next made as to 
10 whether CON6 gpl2 0 antiserum raised in guinea pigs 
could react to different subtype Env proteins. It 
was found that the CON6 serum reacted to its own and 
other subtype Env proteins equally well, with the 
exception of subtype A Env protein (Figure 4) . 

15 

Induction of T Cell Responses to CON6 , Subtype 
B and Subtype C Envelope Overlapping Peptides. To 
compare T cell immune responses induced by CON 6 Env 
immunogens with those induced by subtype specific 

20 immunogens, two additional groups of mice were 

immunized with subtype B or subtype C DNAs and with 
corresponding rW expressing subtype B or C envelope 
proteins. Mice immunized with subtype B (JRFL) or 
subtype C (96ZM651) Env immunogen had primarily 

25 subtype- specific T cell immune responses (Figure 5) . 
IFN-y SFCs from mice immunized with JRFL (subtype B) 
immunogen were detected after stimulation with 
subtype B (MN) peptide pools, but not with either 
subtype C (Chnl9) or CONS peptide pools. IFN-y SFCs 

30 from mice immunized with 96ZM651 (subtype C) 
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immunogen were detected after the stimulation with 
both subtype C (Chnl9) and CON6 peptide pools, but 
not with subtype B (MN) peptide pools- In contrast, 
IFN-y SFCs were identified from mice immunized with 
5 CON6 Env immunogen s when stimulated with either CON6 
peptide pools as well as by subtype B or C peptide 
pools (Figure 5) . The T cell immune responses 
induced by CON6 gpl40 appeared more robust than 
those induced by CONG gpl20. Taken together, these 
10 data demonstrated that CON6 gpl20 and gpl40CF 

immunogens were capable of inducing T cell responses 
that recognized T cell epitopes of wild- type subtype 
B and C envelopes. 

15 Induction of Antibodies by Recombinant CONS 

gp!20 and gpl4 0CF Envelopes that Neutralize HIV-1 
Subtype B and C Primary Isolates . To determine if 
the CON6 envelope immunogens can induce antibodies 
that neutralize HIV-1 primary isolates, guinea pigs 

20 were immunized with either CON6 gpl2 0 or gpl4 0CF 

protein. Sera collected after 4 or 5 immunizations 
were used for neutralization assays and compared to 
the corresponding prebleed sera. Two AT- 2 
inactivated HIV-1 isolates (ADA and AD8) were tested 

25 in syncytium inhibition assays (Table 1A) . Two 
subtype B SHIV isolates, eight subtype B primary 
isolates, four subtype C, and one each subtype A, D, 
and E primary isolates were tested in either the MT- 
2 or the lucif erase-based assay (Table IB) . In the 

30 syncytium inhibition assay, it was found that 
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antibodies induced by both CON 6 gpl2 0 and gpl4 0CF 
proteins strongly inhibited AT-2 inactivated ADA and 
AD8-induced syncytia (Table 1A) . In the MT-2 assay, 
weak neutralization of 1 of 2 SHIV isolates (SHIV 
SF162P3) by two gpl20 and one gpl40CF sera was found 

(Table IB) . In the lucif erase-based assay, strong 
neutralization of 4 of 8 subtype B primary isolates 

(BX08, SF162, SS1196, and BAL) by all gpl20 and 
gpl40CF sera was found, and weak neutralization of 2 
of 8 subtype B isolates (6101, 0692) by most gpl20 
and gpl40CF sera was found. No neutralization was 
detected against HIV-1 PAVO (Table IB) . Next, the 
CON6 anti-gpl2 0 and gpl4 0CF sera were tested against 
four subtype C HIV-1 isolates, and weak 

neutralization of 3 of 4 isolates (DU179, DU368, and 
S080) was found, primarily by anti-CON6 gpl20 sera. 
One gpl40CF serum, no. 653, strongly neutralized 
DU179 and weakly neutralized S080 (Table IB) . 
Finally, anti-CON6 Env sera strongly neutralized a 
subtype D isolate (93ZR001) , weakly neutralized a 
subtype E (CM244) isolate, and did not neutralize a 
subtype A (92RW020) isolate. 
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Table 1A 



Ability of HIV-1 Group M Consensus Envel pe CON6 Pr t ins to Induce 
Fusion Inhibiting Antibodies m 







Syncytium Inhibition antibody titer 1 


Guinea Pig No. 


Immunogen 


AD8 


ADA 


646 


gpi20 


270 


270 


647 


gpi20 


90 


90 


648 


gpi20 


90 


270 


649 


gpi20 


90 


90 


Geometric Mean Titer 




119 


156 


650 


gpi40 


270 


270 


651 


gpi40 


90 


90 


652 


gp140 


810 


810 


653 


gpi40 


270 


90 


Geometric Mean Titer 




270 


207 



Reciprocal serum dilution at which HIV-induced syncytia of Sup T1 cells was 
inhibited by >90% compared to pre-immune serum. All prebleed sera were negative 
(titer <10). 

5 
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CONCLUSIONS 

The production of an artificial HIV-1 Group M 
consensus env gene (CON6) has been described that 
5 encodes a functional Env protein that is capable of 
utilizing the CCR5 co-receptor for mediating viral 
entry. Importantly, this Group M consensus envelope 
gene could induce T and B cell responses that 
recognized epitopes of subtype B and C HIV-1 primary 

10 isolates. 

The correlates of protection to HIV-1 are not 
conclusively known. Considerable data from animal 
models and studies in HIV- 1 - infected patients 
suggest the goal of HIV-1 vaccine development should 

15 be the induction of broadly-reactive CD4+ and CD8+ 
ant i -HIV-1 T cell responses (Letvin et al , Annu. 
Rev. Immunol. 20:73-99 (2002)) and high levels of 
antibodies that neutralize HIV-1 primary isolates of 
multiple subtypes (Mascola et al, J. Virol. 73:4009- 

20 4018 (1999), Mascola et al , Nat. Med. 6:270-210 
(2000) ) . 

The high level of genetic variability of HIV-1 
has made it difficult to design immunogens capable 
of inducing immune responses of sufficient breadth 

25 to be clinically useful. Epitope based vaccines for 
T and B cell responses (McMichael et al , Vaccine 
20:1918-1921 (2002), Sbai et al , Curr . Drug Targets 
Infect, Disord. 1:303-313 (2001), Haynes, Lancet 
348:933-937 (1996)), constrained envelopes 

30 reflective of fusion intermediates (Fouts et al , 

Proc. Natl. Acad. Sci . USA 99:11842-22847 (2002)), 
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as well as exposure of conserved high- order 
structures for induction of anti-HIV-1 neutralizing 
antibodies have been proposed to overcome HIV-1 
variability (Roben et al, J. Virol. 68:4821-4828 
5 (1994), Saphire et al, Science 293:1155-1159 
(2001) ) . However, with the ever- increasing 
diversity and rapid evolution of HIV-1, the virus is 
a rapidly moving complex target, and the extent of 
complexity of HIV-1 variation makes all of these 

10 approaches problematic. The current most common 
approach to HIV-1 immunogen design is to choose a 
wild- type field HIV-1 isolate that may or may not be 
from the region in which the vaccine is to be 
tested. Polyvalent envelope immunogens have been 

15 designed incorporating multiple envelope immunogens 
(Bartlett et al, AIDS 12:1291-1300 (1998), Cho et 
al, J . Virol. 75:2224-2234 (2001)). 

The above -described study tests a new strategy 
for HIV-1 immunogen design by generating a group M 

20 consensus env gene (CON6) with decreased genetic 

distance between this candidate immunogen and wild- 
type field virus strains. The CON6 env gene was 
generated for all subtypes by choosing the most 
common amino acids at most positions (Gaschen et al, 

25 Science 296:2354-2360 (2002), Korber et al , Science 
288:1789-1796 (2000)). Since only the most common 
amino acids were used, the majority of antibody and 
T cell epitopes were well preserved. Importantly, 
the genetic distances between the group M consensus 

30 env sequence and any subtype env sequences was about 
15%, which is only half of that between wild- type 
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subtypes (30%) (Gaschen et al , Science 2 96:2354-2360 
(2002) ) . This distance is approximately the same as 
that among viruses within the same subtype. 
Further, the group M consensus env gene was also 
5 about 15% divergent from any recombinant viral env 
gene, as well, since CRFs do not increase the 
overall genetic divergence among subtypes. 

Infectivity of CON6-Env pseudovirions was 
confirmed using a single-round infection system, 

10 although the infectivity was compromised, indicating 
the artificial envelope was not in an "optimal" 
functional conformation, but yet was able to mediate 
virus entry. That the CON6 envelope used CCR5 (R5) 
as its coreceptor is important, since majority of 

15 HIV-1 infected patients are initially infected with 
R5 viruses . 

BIAcore analysis showed that both CON6 gp!20 
and gpl4 0CF bound sCD4 and a number of mabs that 
bind to wild-type HIV-1 Env proteins. The 

20 expression of the CON6 gpl2 0 and 14 0CF proteins that 
are similar antigenically to wild-type HIV-1 
envelopes is an important step in HIV-1 immunogen 
development. However, many wild-type envelope 
proteins express the epitopes to which potent 

25 neutralizing human mabs bind, yet when used as 
immunogens themselves, do not induce broadly 
neutralizing ant i -HIV-1 antibodies of the 
specificity of the neutralizing human mabs. 
The neutralizing antibody studies were 

30 encouraging in that both CON6 gpl2 0 and gpl4 0CF 

induced antibodies that neutralized select subtype 
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B, C and D HIV-1 primary isolates. However, it is 
clear that the most difficult -to -neutralize primary 
isolates (PAVO, 6101, BG1168, 92RW020, CM244) were 
either only weakly or not neutralized by anti-CON6 
5 gpl20 or gpl40 sera (Table IB) . Nonetheless, the 
CON6 envelope immunogenicity for induction of 
neutralizing antibodies is promising, given the 
breadth of responses generated with a subunit 
envelope protein. Previous studies with poxvirus 

10 constructs expressing gpl20 and gpl60 have not 

generated high levels of neutralizing antibodies 
(Evans et al , J. Infect. Dis. 180:290-298 (1999), 
Polacino et al , J. Virol. 73:618-630 (1999), 
Ourmanov et al, J. Virol. 74:2960-2965 (2000), Pal 

15 et al, J". Virol 76:292-302 (2002), Excler and 

Plotkin, AIDS 11 (Suppl A):S127-137 (1997). rW 
expressing secreted CON6 gpl20 and gpl40 have been 
constructed and antibodies that neutralize HIV-1 
primary isolates induced. It will be important to 

20 determine the breadth of neutralizing antibodies 
induced by DNA or recombinant adenovirus prime 
followed by recombinant vaccinia virus boost, to 
determine if expression of the CON6 envelope in vivo 
induces a broader neutralizing response than protein 

25 immunizations . 

The structure of an oligomeric gpl40 protein is 
critical when evaluating protein immunogenicity. In 
this regard, study of purified CON6 gpl40CF proteins 
by fast performance liquid chromatography (FPLC) and 

30 analytical ul tracentrif iguration has demonstrated 
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that the purified gpl4 0 peak consists predominantly 
of t rimers with a small component of dimers . 

Thus, centralized envelopes such as CON6 are 
attractive candidates for preparation of various 
5 potentially "enhanced" envelope immunogens including 
CD4-Env complexes, constrained envelope structures, 
and trimeric oligomeric forms. The ability of CON6- 
induced T and B cell responses to protect against 
HIV-1 infection and/or disease in SHIV challenge 

10 models will be studied in non-human primates. 

The above study has demonstrated that 
artificial centralized HIV-1 genes such as group M 
consensus env gene (CON6) can also induce T cell 
responses to T cell epitopes in wild-type subtype B 

15 and C Env proteins as well as to those on group M 
consensus Env proteins (Figure 5) . While the DNA 
prime and rW boost regimen with CON6 gpl4 0CF 
immunogen clearly induced IFN-y producing T cells 
that recognized subtype B and C epitopes, further 

20 studies are needed to determine if centralized 

sequences such as are found in the CON6 envelope are 
significantly better at inducing cross-clade T cell 
responses than wild-type HIV-1 genes (Ferrari et al, 
Proc. Natl. Acad. Sci . USA 94:1396-1401 (1997), 

25 Ferrari et al , AIDS Res. Hum. Retroviruses 16:1433- 
1443 (2000) ) . However, the fact that CON6 prime and 
boosted splenocyte T cells recognized HIV-1 subtype 
B and C T cell epitopes is an important step in 
demonstration that CON6 can induce T cell responses 

30 that might be clinically useful. 
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Three computer models (consensus, ancestor and 
center of the tree (COT) ) have been proposed to 
generate centralized HIV-1 genes (Gaschen et al , 
Science 296:2354-2360 (2002), Gao et al , Science 
299:1517-1518 (2003), Nickle et al, Science 
299:1515-1517 (2003), Korber et al , Science 
288:1789-1796 (2000). They all tend to locate at 
the roots of the star-like phylogenetic trees for 
most HIV-1 sequences within or between subtypes. As 
experimental vaccines, they all can reduce the 
genetic distances between immunogens and field virus 
strains. However, consensus, ancestral and COT 
sequences each have advantages and disadvantages 
(Gaschen et al, Science 296:2354-2360 (2002), Gao et 
al, Science 299:1517-1518 (2003), Nickle et al , 
Science 299:1515-1517 (2003). Consensus and COT 
represent the sequences or epitopes in sampled 
current wild-type viruses and are less affected by 
outliers HIV-1 sequences, while ancestor represents 
ancestral sequences that can be significantly 
affected by outlier sequences. However, at present, 
it is not known which centralized sequence can serve 
as the best immunogen to elicit broad immune 
responses against diverse HIV-1 strains, and studies 
are in progress to test these different strategies. 

Taken together, the data have shown that the 
HIV-1 artificial CON6 envelope can induce T cell 
responses to wild-type HIV-1 epitopes, and can 
induce antibodies that neutralize HIV-1 primary 
isolates, thus demonstrating the feasibility and 
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promise of using artificial centralized HIV-1 
sequences in HIV-1 vaccine design. 

EXAMPLE 2 



5 HIV-1 Subtype C Ancestral and Consensus Envelope 

Glycoproteins 

EXPERIMENTAL DETAILS 

HIV-1 subtype C ancestral and consensus env 
genes were obtained from the Los Alamos HIV 

10 Molecular Immunology Database (http://hiv- 

web.lanl.gov/immunology), codon-usage optimized for 
mammalian cell expression, and synthesized (Fig, 6) . 
To ensure optimal expression, a Kozak sequence 
(GCCGCCGCC) was inserted immediately upstream of the 

15 initiation codon. In addition to the full-length 
genes, two truncated env genes were generated by 
introducing stop codons immediately after the gp41 
membrane -spanning domain (IVNR) and the gpl20/gp41 
cleavage site (REKR) , generating gpl40 and gpl20 

20 form of the glycoproteins, respectively (Fig. 8) . 

Genes were tested for integrity in an in vitro 
transcription/translation system and expressed in 
mammalian cells. To determine if the ancestral and 
consensus subtype C envelopes were capable of 

25 mediating fusion and entry, gpl60 and gpl40 genes 
were co- transf ected with an HIV-1/ SG3Aenv provirus 
and the resulting pseudovirions tested for 
infectivity using the JC53-BL cell assay (Fig. 7) . 
Co-receptor usage and envelope neutralization 

30 sensitivity were also determined with slight 
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modifications of the JC53-BL assay, Codon-usage 
optimized and rev-dependent 96ZAM651 env genes were 
used as contemporary subtype C controls. 



RESULTS 

5 

Codon-optimized subtype C ancestral and 
consensus envelope genes (gpl60, gpl40, gpl20) 
express high levels of env glycoprotein in mammalian 
cells (Fig. 9) . 

10 Codon-optimized subtype C gpl60 and gpl40 

glycoproteins are efficiently incorporated into 
virus particles. Western Blot analysis of sucrose- 
purified pseudovirions reveals ten-fold higher 
levels of virion incorporation of the codon- 

15 optimized envelopes compared to that of a rev- 
dependent contemporary envelope controls (Fig. 10A) . 

Virions pseudotyped with either the subtype C 
consensus gpl60 or gpl4 0 envelope were more 
infectious than pseudovirions containing the 

20 corresponding gpl60 and gpl40 ancestral envelopes. 

Additionally, gpl60 envelopes were consistently more 
infectious than their respective gpl40 counterparts 
(Fig. 10B) . 

Both subtype C ancestral and consensus 

25 envelopes utilize CCR5 as a co-receptor to mediate 
virus entry (Fig. 11) . 

The infect±vJLty of siofefcype C ancestral and 
consensus gpl60 containing pseudovirions was 
neutralized by plasma from subtype C infected 

30 patients. This suggests that these artificial 
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envelopes possess a structure that is similar to 
that of native HIV-1 env glycoproteins and that 
common neutralization epitopes are conserved. No 
significant differences in neutralization potential 
5 were noted between subtype C ancestral and consensus 
env glycoproteins (gpl60) (Fig. 12) . 

CONCLUSIONS 

HIV-1 subtype C viruses are among the most 
prevalent circulating isolates, representing 

10 approximately fifty percent of new infections 
worldwide. Genetic diversity among globally 
circulating HIV-1 strains poses a challenge for 
vaccine design. Although HIV-1 Env protein is highly 
variable, it can induce both humoral and cellular 

15 immune responses in the infected host . By analyzing 
7 0 HIV-1 complete subtype C env sequences, consensus 
and ancestral subtype C env genes have been 
generated. Both sequences are roughly equidistant 
from contemporary subtype C strains and thus 

20 expected to induce better cross-protective immunity. 
A reconstructed ancestral or consensus sequence 
derived- immunogen minimizes the extent of genetic 
differences between the vaccine candidate and 
contemporary isolates. However, consensus and 

25 ancestral subtype C env genes differ by 5% amino 
acid sequences. Both consensus and ancestral 
sequences have been synthesized for analyses. 
Codon- optimized subtype C ancestral and consensus 
envelope genes have been constructed and the in 
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vitro biological properties of the expressed 
glycoproteins determined. Synthetic subtype C 
consensus and ancestral env genes express 
glycoproteins that are similar in their structure, 
function and antigenicity to contemporary subtype C 
wild- type envelope glycoproteins. 



EXAMPLE 3 



Codon-Usage Optimization of Consensus of Subtype C 
gragr and nef Genes (C. con. gag and C.con.nef ) 



Subtype C viruses have become the most 
prevalent viruses among all subtypes of Group M 
viruses in the world. More than 50% of HIV-1 
infected people are currently carrying HIV-1 subtype 
C viruses. In addition, there is considerable 
intra- subtype C variability: different subtype C 
viruses can differ by as much as 10%, 6%, 17% and 
16% of their Gag, Pol, Env and Nef proteins, 
respectively. Most importantly, the subtype C 
viruses from one country can vary as much as the 
viruses isolated from other parts of the world. The 
only exceptions are HIV-1 strains from India/China, 
Brazil and Ethiopia/Djibouti where subtype C appears 
to have been introduced more recently. Due to the 
high genetic variability of subtype C viruses even 
within a single country, an immunogen based on a 
single virus isolate may not elicit protective 
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immunity against other isolates circulating in the 
same area . 

Thus gag and nef gene sequences of subtype C 
viruses were gathered to generate consensus 
5 sequences for both genes by using a 50% consensus 

threshold. To avoid a potential bias toward founder 
viruses, only one sequence was used from 
India/China, Brazil and Ethiopia/Djibouti, 
respectively, to generate the subtype C consensus 

10 sequences (C. con. gag and C. con. nef). The codons of 
both C. con. gag and C. con. nef genes were optimized 
based on the codon usage of highly expressed human 
genes. The protein expression following transfection 
into 293T cells is shown in Figure 13. As can be 

15 seen, both consensus subtype C Gag and Nef proteins 
were expressed efficiently and recognized by Gag- 
and Nef -specific antibodies. The protein expression 
levels of both C. con. gag and C. con. nef genes are 
comparible to that of native subtype env gene 

20 (96ZM651) . 



EXAMPLE 4 



Synthesis of a Full Length "Consensus of the 
25 Consensus env Gene with Consensus Variable Regions" 

(CONs) 

In the synthesized "consensus of the consensus" 
env gene (CON6) , the variable regions were replaced 



56 



766537 



with the corresponding regions from a contemporary 
subtype C virus (98CN006) . A further con/ con gene 
has been designed that also has consensus variable 
regions (CONs) . The codons of the CONs env gene were 
5 optimized based on the codon usage of highly 

expressed human genes. (See Figs. 14A and 14B for 
amino acid sequences and nucleic acid sequences, 
respectfully. ) 

Paired oligonucleotides (80-mers) which overlap 

10 by 20 bp at their 3' ends and contain invariant 
sequences at their 5' and 3' ends, including the 
restriction enzyme sites EcoRI and Bbsl as well as 
BsmBI and BamHI , respectively, were designed. Bbsl 
and BamHI are Type II restriction enzymes that 

15 cleave outside of their recognition sequences. They 
have been positioned in the oligomers in such a way 
that they cleave the first four resides adjacent to 
the 18 bp invariant region, leaving 4 base 5' 
overhangs at the end of each fragment for the 

20 following ligation step. 26 paired oligomers were 
linked individually using PCR and primers 
complimentary to the 18 bp invariant sequences. 
Each pair was cloned into pGEM-T (Promega) using the 
T/A cloning method and sequenced to confirm the 

25 absence of inadvertent mutations/deletions. pGEM-T 
subclones containing the proper inserts were then 
digested, run on a 1% agarose gel, and gel purified 
(Qiagen) . Four individual 108-mers were ligated 
into pcDNA3 . 1 (Invitrogen) in a multi-fragment 

30 ligation reaction. The four-way ligations occurred 
among groups of fragments in a stepwise manner from 
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the 5' to the 3' end of the gene. This process was 
repeated until the entire gene was reconstructed in 
the pcDNA3 . 1 vector . 

A complete CONs gene was constructed by 
ligating the codon usage optimized oligo pairs 
together. To confirm its open reading frame, an in 
vitro transcription and translation assay was 
performed. Protein products were labeled by S 35 - 
methionine during the translation step, separated on 
a 10% SDS-PAGE, and detected by radioautography . 
Expected size of the expressed CONS gpl6 0 was 
identified in 4 out of 7 clones (Fig. 14C) . 

CONs Env protein expression in the mammalian 
cells after transfected into 293T cells using a 
Western blot assay (Figure 15) . The expression level 
of CONs Env protein is very similar to what was 
observed from the previous CON6 env clone that 
contains the consensus conservative regions and 
variable loops from 98CN006 virus isolate. 

The Env-pseudovirons was produced by 
cotransf ecting CONs env clone and env-def icient SG3 
proviral clone into 293T cells. Two days after 
transfection, the pseudovirions were harvested and 
infected into JC53BLi-13 cells. The infectious units 
(IU) were determined by counting the blue cells 
after staining with X-gal in three independent 
experiments. When compared with CON6 env clone, CONs 
env clones produce similar number of IU in JC53BL-13 
cells (Figure 16) . The IU titers for both are about 
3 log higher than the SG3 backbone clone control (No 
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Env) . However, the titers are also about 2 log 
lower than the positive control (the native HIV-1 
env gene, NL4-3 or YU2) . These data suggest that 
both consensus group M env clones are biologically 
functional. Their functionality, however, has been 
compromised. The functional consensus env genes 
indicate that these Env proteins fold correctly, 
preserve the basic conformation of the native Env 
proteins, and are able to be developed as universal 
Env immunogens . 

It was next determined what coreceptor CONs Env 
uses for its entry into JC53-BL cells. When treated 
with CXCR4 blocking agent AMD3100, the infectivity 
of NL4-3 Env-pseudovirons was blocked while the 
infectivity of YU2 , CONs or CON6 Env-pseudovirons 
was not inhibited. In contrast, when treated with 
CCR5 blocking agent TAK779, the infectivity of NL4-3 
Env-pseudovirons was not affected, while the 
infectivity of YU2 , CONs or CON6 Env-pseudovirons 
was inhibited. When treated with both blocking 
agents, the infectivity of all pseudovirions was 
inhibited. Taken together, these data show that the 
CONs as well as CON6 envelope uses the CCR5 but not 
CXCR4 co-receptor for its entry into target cells. 

It was next determined whether CON6 or CONs Env 
proteins could be equally efficiently incorporated 
in to the pseudovirions. To be able precisely 
compare how much Env proteins were incorporated into 
the pseudovirions, each pseudovirions is loaded on 
SDS-PAGE at the same concent raion : 5|ig total protein 



59 



for cell lysate, 25ng p24 for cell culture 
supernatant, or 150ng p24 for purified virus stock 
(concentrated pseudovirions after super- speed 
centrif ugation) . There was no difference in amounts 
5 of Env proteins incorporated in CON6 or CONs Env- 

pseudovirions in any preparations (cell lysate, cell 
culture supernatant or purified virus stock:) (Figure 
17) . 

EXAMPLE 5 

io Synthesis of a Consensus Subtype A Full Length env 

(A. con. env) Gene 

Subtype A viruses are the second most prevalent 
HIV-1 in the African continent where over 70% of 

15 HIV-1 infections have been documented. Consensus 
gag, env and nef genes for subtype C viruses that 
are the most prevalent viruses in Africa and in the 
world were previously generated. Since genetic 
distances between subtype A and C viruses are as 

20 high as 30% in the env gene, the cross reactivity or 
protection between both subtypes will not be 
optimal. Two group M consensus env genes for all 
subtypes were also generated. However, to target 
any particular subtype viruses, the subtype specific 

25 consensus genes will be more effective since the 
genetic distances between subtype consensus genes 
and field viruses from the same subtype will be 
smaller than that between group M consensus genes 
and these same viruses. Therefore, consensus genes 

30 need to be generated for development of subtype A 
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specific immunogens . The codons of the A. con. env 
gene were optimized based on the codon usage of 
highly expressed human genes. (See Figs, 18A and 
18B for amino acid and nucleic acid sequences, 

5 respectively.) 

Each pair of the oligos has been amplified, 
cloned, ligated and sequenced. After the open 
reading frame of the A. con env gene was confirmed by 
an in vitro transcription and translation system, 

10 the A. con env gene was transfected into the 2 93T 
cells and the protein expression and specificity 
confirmed with the Western blot assay (Figure 18) . 
It was then determined whether A. con envelope is 
biologically functional. It was co- transfected with 

15 the env-def ective SG3 proviral clone into 293T 

cells. The pseudotyped viruses were harvested and 
used to infect JC53BL cells. Blue cells were 
detected in JC53-BL cells infected with the A. con 
Env-pseudovirions , suggesting that A. con Env protein 

20 is biologically functional (Table 2) . However, the 
infectious titer of A. con Env-psuedovirions was 
about 7- fold lower than that of pseudovirions with 
wild- type subtype C envelope (Table 2) . Taken 
together, the biological function A. con Env proteins 

25 suggests that it folds correctly and may induce 

linear and conformational T and B cell epitopes if 
used as an Env immunogen. 
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JC53BL13 (lU/ul) 





3/31/03 


4/7/03 


4/25/03 




non filtered supt. 


0.22pm filtered 


0.22Mm filtered 


A.con +SG3 


4 


8.5 


15.3 


96ZM651 +SG3 


87 


133 


104 


SG3 backbone 


0 


0.07 


0.03 


Neg control 


0 


0.007 


0 



Table 2. Infectivity of pseudovirons with A.con env genes 



EXAMPLE 6 

Design of Full Length "Consensus of the Consensus 
5 Sf^gr, pol and nef Genes 11 (M.con.gag, M. con. pol and 

M.con.nef) and a Subtype C Consensus pol Gene 

(C . con. pol) 

For the group M consensus genes, two different 
10 env genes were constructed, one with virus specific 
variable regions (CON6) and one with consensus 
variable regions (CONs) . However, analysis of T 
cell immune responses in immunized or vaccinated 
animals and humans shows that the env gene normally 
15 is not a main target for T cell immune response 
although it is the only gene that will induce 
neutralizing antibody. Instead, HIV-1 Gag, Pol and 
Nef proteins are found to be important for inducing 
potent T cell immune responses. To generate a 
20 repertoire of immunogens that can induce both 

broader humoral and cellular immune responses for 
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all subtypes, it may be necessary to construct other 
group M consensus genes other than env gene alone. 
"Consensus of the consensus 11 gragr, pol and nef genes 
(M. con. gag. , M. con. pol and M. con. nef ) have been 
5 designed. To generate a subtype consensus pol gene, 
the subtype C consensus pol gene (C. con. pol) was 
also designed. The codons of the M. con. gag., 
M. con. pol , M. con. nef and C. con. pol. genes were 
optimized based on the codon usage of highly 
10 expressed human genes. (See Fig. 19 for nucleic 
acid and amino acid sequences.) 

EXAMPLE 7 

Synthetic Subtype B Consensus gag and env Genes 

EXPERIMENTAL DETAILS 

15 Subtype B consensus gragr and env sequences were 

derived from 3 7 and 13 7 contemporary HIV-1 strains, 
respectively, codon-usage optimized for mammalian 
cell expression, and synthesized (Figs. 20A and 
20B) . To ensure optimal expression, a Kozak 

20 sequence (GCCGCCGCC) was inserted immediately 

upstream of the initiation codon. In addition to 
the full-length env gene, a truncated env gene was 
generated by introducing a stop codon immediately 
after the gp41 membrane -spanning domain (IVNR) to 

25 create a gpl45 gene. Genes were tested for 

integrity in an In vitro transcription/translation 
system and expressed in mammalian cells. (Subtype B 
consensus Gag and Env sequences are set forth in 
Figs. 20C and 20D, respectively.) 
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To determine if the subtype B consensus 
envelopes were capable of mediating fusion and 
entry, gpl60 and gpl45 genes were co- transf ected 
with an HIV-l/SG3Aenv provirus and the resulting 
5 pseudovirions were tested for infect ivity using the 
JC53-BL cell assay. JC53-BL cells are a derivative 
of HeLa cells that express high levels of CD4 and 
the HIV-1 coreceptors CCR5 and CXCR4 . They also 
contain the reporter cassettes of luciferase and [$- 

10 galactosidase that are each expressed from an HIV-1 
LTR. Expression of the reporter genes is dependent 
on production of HIV-1 Tat. Briefly, cells are 
seeded into 24-well plates, incubated at 37°C for 24 
hours and treated with DEAE - Dext ran at 3 7 °C for 

15 3 0min. Virus is serially diluted in 1% DMEM, added 
to the cells incubating in DEAE - dext ran , and allowed 
to incubate for 3 hours at 37 °C after which an 
additional SOO/^L of cell media is added to each 
well. Following a final 48-hour incubation at 37°C, 

20 cells are fixed, stained using X-Gal, and overlaid 
with PBS for microscopic counting of blue foci. 
Counts for mock- infected wells, used to determine 
background, are subtracted from counts for the 
sample wells. Co-receptor usage and envelope 

25 neutralization sensitivity were also determined with 
slight modifications of the JC53-BL assay. 

To determine whether the subtype B consensus 
Gag protein was capable of producing virus -like 
particles (VLPs) that incorporated Env 

30 glycoproteins, 293T cells were co- transf ected with 



64 



766537 



subtype B consensus gag and env genes. 48-hours 
post-transfection, cell supernatants containing VLPs 
were collected, clarified in a tabletop centrifuge, 
filtered through a 0 . 2mM filter, and pellet through 
5 a 2 0% sucrose cushion. The VLP pellet was 

resuspended in PBS and transferred onto a 20-60% 
continuous sucrose gradient. Following overnight 
cent rifugat ion at 100,000 x g, 0.5 ml fractions were 
collected and assayed for p24 content. The 

10 refractive index of each fraction was also measured. 
Fractions with the correct density for VLPs and 
containing the highest levels of p24 were pooled and 
pellet a final time. VIjP- containing pellets were 
re -suspended in PBS and loaded on a 4-20% SDS-PAGE 

is gel. Proteins were transferred to a PVDF membrane 
and probed with serum from a subtype B HIV-1 
infected individual . 

RESULTS 

20 Codon-usage optimized, subtype B consensus 

envelope (gpl60, gpl45) and gag genes express high 
levels of glycoprotein in mammalian cells (Fig. 21) . 

Subtype B gpl60 and gpl4 5 glycoproteins are 
efficiently incorporated into virus particles. 

25 Western Blot analysis of sucrose-purified 

pseudovirions suggests at least five- fold higher 
levels of consensus B envelope incorporation 
compared to incorporation of a rev-dependent 
contemporary envelope (Fig.23A). Virions 

30 pseudo typed with either the subtype B consensus 
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gp!60 or gpl4 5 envelope are more infectious than 
pseudovirions containing a rev-dependent 
contemporary envelope (Fig. 23 B) . 

Subtype B consensus envelopes utilize CCR5 as 
5 the co-receptor to gain entry into CD 4 bearing 
target cells (Fig. 22) . 

The infectivity of pseudovirions containing the 
subtype B consensus gpl60 envelope was neutralized 
by plasma from HIV-1 subtype B infected patients 
10 (Fig. 24C) and neutralizing monoclonal antibodies 
(Fig. 24A) . This suggests that the subtype B 
synthetic consensus B envelopes is similar to native 
HIV-1 Env glycoproteins in its overall structure and 
that common neutralization epitopes remain intact. 
15 Figs. 24B and 24D show neutralization profiles of a 
subtype B control envelope (NL4 . 3 Env). 

Subtype B consensus Gag proteins are able to 
bud from the cell membrane and form virus -like 
particles (Fig. 25A) . Co- transf ect ion of the codon- 
20 optimized subtype B consensus gag and gpl60 genes 

produces VLPs with incorporated envelope (Fig. 25B) . 

CONCLUSIONS 

The synthetic subtype B consensus env and gag 
genes express viral proteins that are similar in 
25 their structure, function and antigenicity to 

contemporary subtype B Env and Gag proteins. It is 
contemplated that invmunogens based on subtype B 
consensus genes will elicit CTL and neutralizing 
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immune responses that are protective against a broad 
set of HIV-l isolates. 

* * * 

All documents and other information sources 
5 cited above are hereby incorporated in their 
entirety by reference. 
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ABSTRACT 



The present invention relates, in general, to 
an immunogen and, in particular, to an immunogen for 
inducing antibodies that neutralize a wide spectrum 
of HIV primary isolates and/or to an immunogen that 
induces a T cell immune response. The invention 
also relates to a method of inducing anti-HIV 
antibodies, and/or to a method of inducing a T cell 
immune response, using such an immunogen. The 
invention further relates to nucleic acid sequences 
encoding the present immunogens . 
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I 0 : la , 

'TETDNE EIKKCSFNXTTELRDKKOKVryALF^^ 

KVSFEPIPIHYCAPAGFAlLKCNDKltFNGTGPCKNVSTVQCTHGIKPVV 

V3 k 

i : I 

XTNNAKTI IVQLNESVEINCTRPNUNTRKS IHIGPGQAFYATGEI IGDIRQAHCNISRTKWNKTLQQVAK 

V4 



| " 1 

KLREHFNKKT IIFKPS SGGDLE I TTHS FNCGGEFF YCNT S GLFNSTWMFNGT YMFNGTKDNS ET ITLPCR 

_V5_ 



IKQIINMWQGVGQAMYAPPIEGKITCKSNITGLLLTRD^ 

VVKIEPLGVAPTKAKRRVVEREKI^VGIGAVFLGFIjGAAGSTMGAASITLTVQARQ 

aieaqqhllqltvwgikqlqarvxaverylkdqqllgiwgc^ 

WMEWER^ISNYTDI IYRIjIEESQNQQEKNE 

VFAVXjSIVNRVT^QGYSPIjSFQTIjIPNPRGPDRPEGIEEEGGEQGRDRSIRLVNGFIiAIiAWDDIiRSIjCXjF^ 

YHRLRDFILIAARTVELIiGRRSLRGIi^GWEAIiKYIiGNLIiQYW 

EIVQRACRAILNIPRRIRQGLERALL 



gplGO □ 
gp140CF □ 
BP120 C 



c 

gp120 gp140CF 




MW 

198 kd 
115 kd 

93 kd 



CONS.env (group M env consensus. This one contain five variable regions in env gene from 98CN006 vino, not in the 

CTGGGCATGCTGATGATCTGCTCCGCCGCCGAGAACCTGTGGGTGACCGTGTACTACGGC 

GTGCCCGTGTGGAAGGAGGCCAACACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTAC 

GACACCGAGGTGCACAAGGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCC 

CAGGAGATCGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAACAACATGGTG 

GAGCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAG 

CTGACCCCCCTGTGCGTGACCCTGAACTGCACCAACGTGCGCAACGTGTCCTCCAACGGC 

ACCGAGACCGACAACGAGGAGATCAAGAACTGCTCCTTCAACATCACCACCGAGCTGCGC 

GACAAGAAGCAGAAGGTGTACGCCCTGTTCTACCGCCTGGACGTGGTGCCCATCGACGAC 

AAGAACTCCTCCGAGATCTCCGGCAAGAACTCCTCCGAGTACTACCGCCTGATCAACTGC 

AACACCTCGGCCATCACCCAGGCCTGCCCCAAGGTGTCCTTCGAGCCCATCCCCATCCAC 

TACTGCGCCCCCGCCGGCTTCGCCATCCTGAAGTGCAACGACAAGAAGTTCAACGGCACC 

GGCCCCTGCAAGAACGTGTCCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGGTGTCC 

ACCCAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTCCGAGAAC 

ATCACCAACAACGCCAAGACCATCATCGTGCAGCTGAACGAGTCCGTGGAGATCAACTGC 

ACCCGCCCCAACAACAACACCCGCAAGTCCATCCACATCGGCCCCGGCCAGGCCTTCTAC 

GCCACCGGCGAGATCATCGGCGACATCCGCCAGGCCCACTGCAACATCTCCCGCACCAAG 

TGGAACAAGACCCTGCAGCAGGTGGCCAAGAAGCTGCGCGAGCACTTCAACAACAAGACC 

ATCATCTTCAAGCCCTCCTCCGGCGGCGACCTGGAGATCACGACCCACTCCTTCAACTGC 

GGCGGCGAGTTCTTCTACTGCAACACCTCCGGCCTGTTCAACTCCACCTGGATGTTCAAC 

GGCACCTACATGTTCAACGGCACCAAGGACAACTCCGAGACCATCACCCTGCCCTGCCGC 

ATCAAGCAGATCATCAACATGTGGCAGGGCGTGGGCCAGGCCATGTACGCCCCCCCCATC 

GAG G GCAAGATC ACCTGC AAGTCCAAC ATCACCGGCCTGCTGCTGACCCGCGACGGCGGC 

AAC AACTCCAACAAGAAC AAGACCGAG ACCTTCCGCCCCGGCG G CGGC GACATGCGCGAC 

AACTGGCGCTCCGAGCTGTACAAGTACAAGGTGGTGAAGATCGAGCCCCTGGGCGTGGCC 

CCCACCAAGGCCAAGCGCCGCGTGGTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCC 

GTGTTCCTGGGCTTCCTGGGCGCCGCCGGCTC CACCATGGGCGCCGCCTCC ATCAC CCTG 

ACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGTCCAACCTGCTGCGC 

GCC ATCGAGGCCCAGCAGCACCTGCTGCAGCTG ACCGTGTGGGGCATCAAG C AG CTGCAG 

GCCCGCGTGCTGGCCGTGGAGCGCTACCTGAAGGACCAGCAGCTGCTGGGCATCTGGGGC 

TGCTCCGGCAAGCTGATCTGCACCACCAACGTGCCCTGGAACTCCTCCTGGTCCAACAAG 

TCCCAGGACGAGATCTGGGACAACATGACCTGGATGGAGTGGGAGCGCGAGATCTCCAAC 

TACACCGACATCATCTACCGCCTGATCGAGGAGTCCCAGAACCAGCAGGAGAAGAACGAG 

CAlGGAGCTGCTGGCCCTGGACAAGTGGGCCTCCCTGTGGAACTGGTTCGACATCACCAAC 

TGGCTGTGGTACATCAAGATCTTCATCATGATCGTGGG CGGCCTGATCGGCC TGCGC ATC 

GTGTTCGCCGTGCTGTCCATCGTGAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTC 

CAGACCCTGATCCCCAACCCCCGCGGCCCCGACCGCCCCGAGGGCATCGAGGAGGAGGGC 

GGCGAGCAGGGCCGCGACCGCTCCATCCGCCTGGTGAACGGCTTCCTGGCCCTGGCCTGG 

GACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCGCCTGCGCGACTTCATCCTGATC 

GCCGCCCGCACCGTGGAGCTGCTGGGCCGCCGCTCCCTGCGCGGCCTGCAGAAGGGCTGG 

GAGGCCCTGAAGTACCTGGGCAACCTGCTGCAGTACTGGGGCCAGGAGCTGAAGAACTCC 

GCCATCTCCCTGCTGGACACCACCGCCATCGCCGTGGCCGAGGGCACCGACCGCGTGATC 

G AG ATCGTGC AGCGCGCCTGCCGCGCCATCCTGAACATCCCCCGC CGC ATCCG CCAGGG C 

CTGGAGCGCGCCCTGCTGTAA 
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C ancenv (subtype C ancestral env. The amino acid sequence is different from Los Alamos Database August 2002) 

GCCGCCATGCGCGTGATGGGCATCCTGCGCAACTGCCAGCAGTGGTGGAT 

CTGGGGCATCCTGGGCTTCTGGATGCTGATGATCTGCTCCGTGGTGGGCA 

ACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGAGGCCAAG 

ACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGAGCGCGAGGTGCA 

CAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCCAGG 

AGATGGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAACGAC 

ATGGTGGACCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCT 

GAAGCCCTGCGTGAAGCTGACCC CCCTGTGC GTGAC CCTGAACTGCACCA 

ACGTGACCAAC G CCACC AACAACAC CTAC AACGGCGAGATGAAGAACTGC 

TCC TTCAA C AT CAC CACCGAG CTGCGC GA CAAGAAGAAGAAG G AGT ACGC 

CCTGTTCTACCGCCTGGACATCGTGCCCCTGAACGAGAACTCCTCCGAGT 

ACCGCCTGATCAACTGCAACACCTCCGCCATCACCCAGGCCTGCCCCAAG 

GTGTCCTTCGACCCCATCCCCATCCACTACTGCGCCCCCGCCGGCTACGC 

CATC CTGAAGTGCAACAACAAGACCTTC AACGGCACCGGCC CCTGCAACA 

AC GTGTCCACCGTGCAGTGCAC CCACGGCATCAAGCCCGTGGTGTCCACC 

CAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTC 

CGAGAACCTGACCGACAACGCCAAGACCATCATCGTGCAGCTGAACGAGT 

CCGTGGAGATCGTGTGCACCCGCCCCAACAACAACACCCGCAAGTCCATG 

CGCATCGGCCCCGGCCAGACCTTCTACGCCACCGGCGACATCATCGGCGA 

CATC C GC CAGGCC C ACTGCAACATCTC CGAGGACAAGTGGAACAAGACCC 

TGCAGCAGGTGGCCGAGAAGCTGGGCAAGCACTTCCCCAACAAGACCATC 

ACCTTCGAGCCCTCCTCCGGCGGCGACCTGGAGATCACCACCCACTCCTT 

CAACTGCCGCGGCGAGTTCTTCTACTGCAACACCTCCAAGCTGTTCAACT 

CCACCTACAACAACAACACCAACTCCAACTCCACCATCACCCTGCCCTGC 

CGCATCAAGCAGATCATCAACATGTGGCAGGGCGTGGGCCAGGCCATGTA 

CGCCCCCCCCATCGCCGGCAACATCACCTGCAAGTCCAACATCACCGGCC 

TGCTGCTGACCCGCGACGGCGGCAAGGAGAACACCACCGAGACCTTCCGC 

CCCGGCGGCGGCGACATGCGCGACAACTGGCGCTCCGAGCTGTACAAGTA 

CAAGGTGGTGGAGATCAAGCCCCTGGGCGTGGCCCCCACCGAGGCCAAGC 

GCCGCGTGGTGGAGCGCGAGAAGCGCGCCGTGGGCCTGGGCGCCGTGTTC 

CTGGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCAC 

CCTGACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGT 

CCAACCTGCTGCGCGCCATCGAGGCCCAGCAGCACATGCTGCAGCTGACC 

GTGTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTGGCCATGGAGCGCTA 

CCTGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGA 

TCTGCACCACCGCCGTGCCCTGGAACTCCTCCTGGTCCAACAAGTCCCTG 

GACGACATCTGGGACAACATGACCTGGATGGAGTGGGACCGCGAGATCTC 

CAACTACACCGACACCATCTACCGCCTGCTGGAGGAGTCCCAGAACCAGC 

AGGAGAAGAACGAGCAGGACCTGCTGGCCCTGGACTCCTGGGAGAACCTG 

TGGAACTG G TTC GACATC AC CAAC TG GCTGTGGTAC ATC AAGATCTTC AT 

CATGATCGTGGGCGGCCTGATCGGCCTGCGCATCATCTTCGCCGTGCTGT 

CCATCGTGAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACC 

CTGACCCCCAACCCCCGCGGCCCCGACCGCCTGGAGCGCATCGAGGAGGA 

GGGCGGCGAGCAGGACCGCGACCGCTCCATCCGCCTGGTGTCCGGCTTCC 

TGGCCCTGGCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCAC 

CGCCTGCGCGACTTCATCCTGATCGCCGCCCGCACCGTGGAGCTGCTGGG 

CCGCTCCTCCCTGCGCGGCCTGCAGCGCGGCTGGGAGGCCCTGAAGTACC 

TGGGCTCCCTGGTGCAGTACTGGGGCCAGGAGCTGAAGAAGTCCGCCATC 

TCCCTGCTGGACACCATCGCCATCGCCGTGGCCGAGGGCACCGACCGCAT 
CATCGAGGTGGTGCAGCG CGCCTGC CGCGCCATCCTGAACATCCCC CGCC 

GCATCCGCCAGGGCTTCGAGGCCGCCCTGCTGTAA 



C.con.env (subtype C consensus env. The amino acid sequence is different from Los Alamos Database August 2002) 

GCCGC CATGCGCGTGATGGGCATCCTG CGCAACTGCCAGCAGTGGTGGAT 

CTGGGGCATCCTGGGCTTCTGGATGCTGATGATCTGCAACGTGGTGGGCA 

ACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGAGGCCAAG 

ACCACCC TGTTCTGCGCCTCCGACGCCAAGGCCTAC GAGAAGGAGGTGCA 

CAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCCAGG 

AGATGGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAACGAC 

ATGG TGGAC CAGATG CACGAG GACATCATCTC CCTG TGG GACCAGTC CCT 

GAAGCCCTGCGTGAAGCTGACCCC CCTGTGC GTGACCCTGAACTGCCGCA 

ACGTGACCAACGCCACCAACAACACCTACAACGAGGAGATCAAGAACTGC 

TCCTTCAACATCACCACCGAGCTGCGCGACAAGAAGAAGAAGGTGTACGC 

CCTGTTCTACCGCCTGGACATCGTGCCCCTGAACGAGAACTCCTCCGAGT 

AC CGC CTGATCAAC TGC AACACCTC CGCCATCACCCAG G CCTGC CCCAAG 

GTGTCCTTCGACCCCATCCCCATCCACTACTGCGCCCCCGCCGGCTACGC 

CATCCTGAAGTGCAACAACAAGACCTTCAACGGCACCGGCCCCTGCAACA 

ACGTGTCCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGGTGTCCACC 

CAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTC 

CGAGAACCTGACCAACAACGCCAA6ACCATCATCGTGCACCTGAACGAGT 

CCGTGGAGATCGTGTGCACCCGCCCCAACAACAACACCCGCAAGTCCATC 

CGCATCGGCCCCGGCCAGACCTTCTACGCCACCGGCGACATCATCGGCGA 

CATCCGCCAGGCCCACTGCAACATCTCCGAGGACAAGTGGAACAAGACCC 

TG CAGCGC GTGTCCAAGAAGCTGAAGGAGCAC7TC CC CAACAAGACCATC 

AAGTTCGAGCCCTCCTCCGGCGGCGACCTGGAGATCACCACCCACTCCTT 

CAACTGCCGCGGCGAGTTCTTCTACTGCAACACCTCCAAGCTGTTCAACT 

CCACCTACAACAACAACACCAACTCCAACTCCACCATCACCCTGCCCTGC 

CGCATCAAGCAGATCATCAACATGTGGCAGGAGGTGGGCCGCGCCATGTA 

CGCCCCCCCCATCGCCGGCAACATCACCTGGAAGTCCAACATCACCGGCC 

TGCTGCTGACCCGCGACGGCGGCAAGAAGAACACCACCGAGATCTTCCGC 

CCC GGCG GCGGGGACATGCG CGACAACTGGCGGTCCGAGCTGTACAAGTA 

CAAGGTGGTGGAGATCAAGC C CCTGGGC GTGGCCCC CA CCAAGGCCAAG C 
GCCGCGTGGTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCGTGTTC 

CTGGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCAC 

CCTGACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGT 

CC AAC CTGCTGC GC GCCATCGAGGCC C AGC AGC ACATGCTGCAGCTGACC 

GTGTGGGGCATCAAGCAGCTGCAGACCCGCGTGCTGGCCATCGAGCGCTA 

CCTGAAGGACC AGCAGCTGCTGGGCATCTGGGG C TGCTCCGGC AAGCTGA 

TCTGCACCACCGCCGTGCCCTGGAACTCCTCCTGGTCCAACAAGTCCCAG 

GAGGACATCTGGGACAACATGACCTGGATGCAGTGGGACCGCGAGATCTC 

CAACTACACCGACACCATCTACCGCCTGCTGGAGGACTCCCAGAACCAGC 

AGGAGAAGAACGAGAAGGACCTGCTGGCCCTGGACTCCTGGAAGAACCTG 

TGGAACTGGTTCGACATCACCAACTGGCTGTGGTACATCAAGATCTTCAT 

CATGATCGTGGGCGGCCTGATCGGCCTGCGCATCATCTTCGCCGTGCTGT 

CC ATC GTGAACCGC GTGC G CC AGGGCTACT C CC CCCTGTCCTTCCAGAC C 

CTGACCCCCAACCCCCGCGGCCCCGACCGCCTGGGCCGCATCGAGGAGGA 

GGGCGGCGAGCAGGACCGCGACCGCTCCATCCGCCTGGTGTCCGGCTTCC 

TGGCCCTGGCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCAC 

CGCCTGCGCGACTTCATCCTGGTGGCCGCCCGCGCCGTGGAGCTGCTGGG 

CCGCTCCTCCCTGCGCGGCCTGCAGCGCGGCTGGGAGGCCCTGAAGTACC 

TGGGCTCCCTGGTGCAGTACTGGGGCCTGGAGCTGAAGAAGTCCGCCATC 

TCCCTG CTGGACACCATCGCGATCGCCGTGGCC GAGGGCACCGAC CG CAT 

CATCGAGCTGATCCAGCGCATCTGCCGCGCCATCCGCAACATCCCCCGCC 

GCATCCGCCAGGGCTTCGAGGCCGCCCTGCAGTAA 



MRVMGILRNCQQWWIWGILGFWMLMICSWGMLWVTVYYGVPVW 
HACVPTDPNPQEMVLEN\n^NFNMVVKNDRWDQMHEDllSL^ 

YNGEMKN CS FN1TTEUIDKKKKEYALFYRLDIVP LNENS SPfRU NCNTSAITQAC PK^ FDP^ WCA 

PAGYA1U<CNMKTFNGTGPCNNV^™QCTHGIK^^ 

ESVEIVCTRPNN^KSMRIGPGQTFYATGDUGDIRC^HCNISEDKW 

EPSSGGDLEITTHSFNCRGEFFYCNTSKLFNSTYNNNTNSNSTITIFCRIK^ 

GWTCKSNITGLLLTROGGKENTnrTFRPGGG^ 

AVGLGAWLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSN LLRA1 EAQQH MLQLTVWG1 KQLOARVL 
AME RYL KDQQLIGIWGCSG KU CTTAVPWNSSWSN KS LDD IWDNMTWM EWDRE 1 SN YTDTJ YRLLEES QN 
QQ EKN EQDLLALOSWEN L WN WFDITNWLWYIK1F1M IVGGUGLR 1 1 PAVLSIVhWN^QGYSPLSFQT^T 
PNPRGPDRLERIEEEGGEQORDRSIRLVSGFlAlAWDDLRSLCLFSYHRLRDFILtAAKTV^LGR^LR 
GLQRGWEAU<YUGSLVQYWGQELKKSAlSLLOTlMAVA£GTDRHEWQRACRAILNlPRRIRQGFEAA 



K (oJt> 



C.con.env (subtype C consensus env) 

MRVMGIUWCQQWWIWGILGFWMLMICNWGNLVWTVYYGV^^ 
I^CVPTOPNPQEMVLENVTENFNMWKNDMVDQMHEDIISLWDQSLKP 
YNEEIKNCSFNITTCLROKKiaCWALFYRLDW^ 
PAGYAJLKCNNKTFNGTGPCNNVSTVQCTmiKPWSTGUJJiGStA 

ESVEIVCTRPNNNTRKSlRfGPGQTFyATGDIIGDIRQAHCMlSEDKWNKTLQRVSKKLKEHFPNKTlKF 

EPSSGGDl^nTHSFNCRGEFFV'CNTSKLFNSTYNNhrmSNSTITUPCRJKQIINMV^EVGRAMYAPPIA 

GNITCKS^TGU^LTRDGGKKNTTElFRPGGGDMRDNWRSELYKYKWEtKPtGVAPTKAKRRWEREKR 

AVGlGAVFLGFLGAAGSTMGAASrTLTVC^RQLLSGIVQGQ 

AiERYLKDCK3UGIVVGCSGKUCTTAWWNSS\A«NKSQE0IW0NMTWM 

QQEKNEKDU^ODSWKNLWNWFDJTNWLWYIKIFIMIVGGUGLRIIFAV^SIVNRVRQGY LSFQTLT 

PNPRGPDRLGRIEEEGGEQORDRSIRLVSGFLALAWDDLRSLCLFSYHRLRDFILVAARAVELLGRSSLR 

GLQRGWEAUCYLGSLVGYWGLEUKSAISLLDTlAlAVAEGTDRitELlQRICRAIRNlPRRiRGGFEAA 



Synthesize entire gene in 80-mer fragments overlapping by 20 residues at the 3' end with invariant 

sequences at the 5* end. 



EcoRI Bbsl 



80mer fragment 



BsmBj BamHl 



80mer fragment 



Invariant Seq. 
(18 nt) 



invariant Seq. 



Paired BOmer oligos are connected via PCR in a stepwise manner from 5' to 3' using 
primers complimentary to the invariant seq. 

BHHl 80mer Fragment 



5*primer 



140bp PCR product 



3'primer 



108bp PCR fragments cloned into pGEM-T and sequenced. Clones with the proper sequence will be 
cut with 2 restriction enzymes. 4 fragments will be ligated together with pcDNA3.1 in a stepwise 

manner from the 5* to 3* end of gene 



Fragments to be ligated with 
pcDNA3.1 

(1-4 are in order from 5* to 3') 


Restriction Enzymes Used 
to Cleave Fragment 


Fragment 1 


EcoRI/BsmBI 


Fragment 2 


Bbsl/BsmBJ 


Fragment 3 


Bbsl/BsmBI 


Fragment 4 


Bbsl/BamHI 


pcDNA3.1 


EcoRI/BamHI 



Fragment 2 



Fragment 3 




Fragment 4 
BamHl 



i 

Ligations will be repeated stepwise 5' to 3' until the entire gene has been cloned 

intopcONA3.1 



pCONA3.1-env 

CUV &5oRJl 



BamHl 



pSG3* 




Pseudotyped virions 

fcfc P»asma or co-receptor Inhibitor) 



JC53BL Indicator cells 
Blue foci counting / Luciterase activity 



125.00% 



Cane gp160 



ZD 

OC 50.00% 



0.00% 



10-7 



125-00% 



10« 

Coon gp160 




10* 10* 

Plasma Dilution 



10* 



10* 




125.00% 



10* 

Zm651 gp160-opt 



10* 10^ 

Plasma Dilution 



10-' 




105 10^ 

Plasma Dilution 



10* 



Plasma from HIV- ! subtype C infected patients Plasma from uninfected donors 



Gag 



4> 




Cell lysate Supernatant 



p33 
p22 



Nef 



c§^0 4^<§^ O 




JL 



Cell lysate Supernatant 



Y\<\wt feC_ 



C .con .gag (subtype C consensus gag) 

MGARASILRGGKLDTWEKIRU?PGGKKRYMiKHLNA/VASRELERFALNPGLLETSEGCKQJMKQLQPA 
LQTGTEELRSLYNTVATLYCWEKJEVRDTKEALDK1EEEQNKSQQKTQQAEAAADGKVSQNYPI 
VQNLQGQMVHQAISPRTLNAWVKVIEEKAFSPEVIPMFTALSEGATPQDLNTM^ 
INEEAAEWORIHPVHAGPIAPGQMREPRGSDIAGTTS^ 

RMYSPVSILDIKQGPKEPFRDYVORFFKTLRAEQATQDVKNWMTDTLLVQNANPDCKTILR^ 
EMMTACQGVGGPSHKARVUVEAMSQANNTNIMMQRSNFKGPKRJVKCFNCG^GHIARNCRAPRKKGCWK 
CGKEGHQMKDCTERQANFIGWWPSHKGRPGNFLQSRPEPTAPPAESFRFEETTPA 
P KQEP KDREPLTSLKSLFGSDPUSQ 



MGGKWSKSSIVGWAVRERIRRTEPAAEGVGAASQDLI^GALTSSNTATNNAOCAWLEAQEEEEEV 

GFPVRPQVPLRPMTYKAAFDLSFFLKEKGGLEGUYSKKRQEI^ 

LTFGVWFKLVPVDPREVEEANEGENNCLLHPMSGHGMED^^ 




C. con .gag (subtype C consensus gag. Not in the public domain) 

gccgccgccatgggcgcccgcgccagcatcctgcgcggcggcaagctggacacctgggagaagatccgcc 

tgcgccccggcggcaagaagcgctacatgatcaagcacctg6tgtgggccagccgcgagctggagcgctt 

cgccctgaaccccggcctgctggagaccagcgagggctgcaagcagatcatgaagcagctgcagcccgcc 

ctgcagaccggcaccgaggagctgcgcagcctgtacaacaccgtggccaccctgtactgcgtgcacgaga 

agatcgaggtgcgcgacaccaaggaggccctggacaagatcgaggaggagcagaacaagagccagcagaa 

gacccagcaggccgaggccgccgccgacggcaaggtgagccagaactaccccatcgtgcagaacctgcag 

ggccagatggtgcaccaggccatcagcccccgcaccctgaacgcctgggtgaaggtgatcgaggagaagg 

ccttcagccccgaggtgatccccatgttcaccgccctgagcgagggcgccaccccccaggacctgaacac 

catgctgaagaccgtgggcggccaccaggccgccatgcagatgctgaaggacaccatcaacgaggaggcc 

gccgagtgggaccgcctggaccccgtgcacgccggccccatcgcccccggcgagatgcgcgagccccgcg 

gcagcgacatcgccggcaccaccagcaccctgcaggagcagatcgcctggatgaccagcaacccccccgt 

g ccc gtgggc gacatctac aagcgctggatgatcctgggcctgaac aagatcgtgcg c atgtacagc ccc 

gtgagcatcctggacatcaagcagggccccaaggagcccttccgcgactacgtggaccgcttcttcaaga 

ccctgcgcgccgagcaggccacccaggacgtgaagaactggatgaccgacaccctgctggtgcagaacgc 

caaccccgactgcaagaccatcctgcgcgccxtgggccccggcgccagcctggaggagatgatgaccgcc 

tgccagggcgtgggcggccccagccacaaggcccgcgtgctggccgaggccatgagccaggccaacaaca 

c caac atcatgatgcagcgc agcaacttcaagggcc ccaagc gc atc gtgaagtgcttcaactgcggcaa 

ggagggccacatcgcccgcaactgccgcgccccccgcaagaagggctgctggaagtgcggcaaggagggc 

cac cag atgaaggactgcaccgagggccaggccaacttcctgggcaagatctggcccagcc acaagggc c 

gccccggcaacttcctgc^gagccgccccgagcccaca^ccccccc^ccgagagcttccgcttcgagga 

gaccacccccgcccccaagcaggagcccaaggaccgcgagcccctgaccagcctgaagagcctgttcggc 

agc gacc ccctgag ccagtaa 

C.con.nef (subtype C consensus net. Not in the pubQc domain) 

gccgccgccatgggcggcaagtggagcaagagcagcatcgtgggctggcccgccgtgcgcgagcgcatcc 

gccgcaccgagcccgccgccgagggcgtgggcgccgccagccaggacctggacaagtacggcgccctgac 

(^gcagcaacaccgccaccaacaacgccgactgcgcctggctggaggcccaggaggaggaggaggaggtg 

ggcttccccgtgcgcccccaggtgcccctgcgccccatgacctacaaggccgccttcgacctgagcttct 

tcctgaaggagaagggcggcctggagggcctgatctacagcaagaagcgccaggagatcctggacctgtg 

ggtgtaccacacccagggcttcttccccgactggcagaactacacccccggccccggcgtgcgctacccc 

ctgaccttcggctggtgcttcaagctggtgcccgtggacccccgcgaggtggaggaggccaacgagggcg 

agaacaactgcctgctgcaccccatgagccagcacggcatggaggacgaggaccgcgaggtgctgaagtg 

gaagttcgacagccacctggcccgccgccacatggcccgcgagctgcaccccgagtactacaaggactgc 

tga 




CONs.env (gorup M consensus anv gene. This one contain tho consensus sequence for variable regions in env 

MRVRGIQRNCQHLWRWGTlILGMLMICSAAENLVWW 

WATHACVPTDPNPQBVLENNn'ENFNMWKNNMVEQM 

TTNNTEEKGEIKNCSFNrTTElRDKKQKVYALFYRtDWPIDDNNNNSSNYRLINCNTSAITQACPKVSF 

EP t PI HYCAPAG FAI LKCND KKFNGTGPC KNVSTVQCTHG I KPWSTQLLLNGSLAEES I IRS ENI TN N 

AKTIIVQLNESVEINCTRPNNNTRKSIRIGPGQAFYATGDIIGOIRQAHCNISGTKWNKTLQQVAKKLRE 

HFNNKTIlFKPSSGGDl^nTTHSFNCRGEFFYCNTSGU^NSTWIGNGTKNNNNTNOTlTLPCRIKQnNM 

VN^GVGQAMYAPPlEGKITCKSNITGLLLTRDGGNNhTTNETEIFRPGGGDMRDNWRSELYKYKNA/KlEPLG 

VAPTKAKRRWEREKRAVG1GAVFLGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLRAIEAQQHL 

LQLTW/GlKQLC^R\rtJWERYLKDQQLLGIWGCSG^^ 
NNYTDIIYSUEESQNQQEKNEQELIJVLOKWASLVWWDITNWLvWIW 

NRVRQGYSPLSFQTLIPNPRGPDRPEGIEEEGGEaORDRSIRLVNGFLALAWDDLRSLCLFSYHRLROFI 
U AAR7VELLGRKGLRRGWEALKYI.WN LLQYWGQ ELKNSAI S LLDTTAIAVAEGTDRVIEWQRACRAI L 
NIPRRIRQGLERALL 



CONs.env (gorvp M consensus env gene This one contain the consensus sequence tor vanabte reatons tn env gene. 

The Identical amino acid sequences as in the public domain) 

GCCC^CGCGATGCGCXTTGCGCGGCttTC^ 

GCGCTGGGGCACCCTGATCCTGGGCATGCTGATGATCTGCTCCGCCGCCG 

AGAACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGAGGCC 

AAC ACCAC C C TGTTC TGCGCCTCC GACG CCAAGGCCT AC GACACCGAGGT 

GCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 

AGGAGATC GTGCTGGAGAAC GTGAC C GAGAAC TTCAACATGTGGAAGAAC 

AACATGGTGGAGCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTC 

CCTGAAGCCCTGCGTGAAGCTGACCCCCCTGTGCGTGACCCTGAACTGCA 

CC AACGTGAACGTGAC CAACACCACC AAGAAC ACCGAGGAGAAGGGC6AG 

ATCAAGAACTGCTCCTTCAACATCACCACCGAGATCCGCGACAAGAAGCA 

GAAGGTGTACGCCCTGTTCTACCGCCTGGACGTGGTGCCCATCGACGACA 

ACAACAACAACTCCTCCAACTACCGCCTGATCAACTGCAACACCTCCGCC 

ATCACCCAGGCCTGCCCCAAGGTGTCCTTCGAGCCCATCCCCATCCACTA 

CTGCXSCCCCCGCCGGCTTCGCC^TCCTGAAGTGCAACGACAAGAAGTTCA 

ACGGCAC CG GCC C CTGCAAGAACGTGTCCAC CGTG CAGTG CACC CACGGC 

ATCAAGCCCGTGGTGTCCACCCAGCTGCTGCTGAACGGCTCCCTGGCCGA 

GGAGGAGATCATCATCCGCTCCGAGAACATCACCAACAACGCCAAGACCA 

TCATCGTGCAGCTGAACGAGTCCGTGGAGATCAACTGCACCCGCCCCAAC 

AACAAC AC CCGCAAGTCCATCCGCATCGGCCCCGGCGAGGCCTTCTACGC 

C ACCGGCG AC ATCATCGG CGAC ATC CGCCAGGCCCACTGCAACATCTCCG 

GCACCAAGTGGAACAAGACCCTGCAGCAGGTGGCCAAGAAGCTGCGCGAG 

CACTTCAACAACAAGACCATCATCTTCAAGCCCTCCTCCGGCGGCGACCT 

GGAGATCACCACCCACTCCTTCAACTGCCGCGGCGAGTTCTTCTACTGCA 

ACACCTCCGGCCTGTTCAACTCCACCTGGATCGGCAACGGCACCAAGAAC 

AACAAC AACACCAACGACACC ATC ACCCTGC CCTGCCGCATCAAG CAGAT 

CATCAACATGTGGCAG GGCGTGGGCCAGGCCATGTACGCCCCCCCCATC G 

AGGGCAAGATCACCTGCAAGTCCAACATCACCGGCCTGCTGCTGACCCGC 

GACGGCGGCAACAACAACACCAACGAGACCGAGATCTTCCGCCCCGGCGG 

CGGCGACATGCGCGACAACTGGCGCTCCGAGCTGTACAAGTACAAGGTGG 

TGAAGATCGAGCCCCTGGGCGTGGCCCCCACCAAGGCCAAGCGCCGCGTG 

GTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCGTGTTCCTGGGCTT 

CCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCACCCTGACCG 

TGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGTCCAACCTG 

CTGCGCGCCATCGAGGCCCAGCAGCACCTGCTGCAGCTGACCGTGTGGGG 

CATCAAGCAGCTGCAGGCCCGCGTGCTGGCCGTGGAGCGCTACCTGAAGG 

ACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGATCTGCACC 

AC CACCGTG C CCTGGAACTCCTCCTGGTCCAACAAGTCCC AGGACGAGAT 

CTGGGACAACATGACCTGGATGGAGTGGGAGCGCGAGATCAACAACTACA 

CCGACATCATCTACTCCCTGATCGAGGAGTCCCAGAACCAGCAGGAGAAG 

AACGAGCAGGAGCTGCTGGCCCTGGACAAGTGGGCCTCCCTGTGGAACTG 

GTTCGACATCACCAACTGGCTGTGGTACATCAAGATCTTCATCATGATCG 

TGGGCGGCCTGATCGGCCTGCGCATCGTGTTCGCCGTGCTGTCCATCGTG 

AACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACCCTGATCCC 

CAACCCCCGCGGCCCCGACCGCCCCGAGGGCATCGAGGAGGAGGGCGGCG 

AGCAGGACCGCGACCGCTCCATCCGCCT GG T GAACGGC TTCCTGGCCCTG 

GCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCGCCTGCG 

CGACTTCATCCTGATCGCCGCCCGCACCGTGGAGCTGCTGGGCCGCAAGG 

GCCTGCGCCGCGGCTGGGAGGCCCTGAAGTACCTGTGGAACCTGCTGCAG 

TACTG GGGCCAGGAGCTGAAGAACTCCGCCATCTCCCTGCTGGAC AC CAC 

CGCCATCGCCGTGGCCGAGGGCACCGACCGCGTGATCGAGGTGGTGCAGC 

GCGCCTGCCGCGCCATCCTGAACATCCCCCGCCGCATCCGCCAGGGCCTG 

GAGCGCGCCCTGCTGTAA 



gpl60 




Cell lysate Supernatant 

Expression of CONs envgene in mammalian cells 




"Vx c, lb 



A ^ 



C 









Lysate Supernatant BlAWV 



Erw protein incorporation in CON6 andCONs Env-pseudovirions 



Aeon.env (subtype A consensus env) 
MRVMGIQRNCQHLWRWGTMIL6MIIICSAAENLWS/T\^ 

WATHAC VPTDPN PQ EIN LENVTEE FNMWKNNMVEQW HTD1I S LWDQ SLKPCVKLTPLC VTLN CSNVNVTT 
NlTNrTDNMKGElKNCSFNWrTTCLRDKK^^ 

FEPtPIHYCAPAGFAILKCKDKEFNGTGPCKNVSTVQCTHGIKPWSTQLLLNGSUVEEEVMlRSENlTN 
NAKN J IVQ LTKPVK1 N CTR P NN NTR KSI R1GPGQAFY ATGD1 1 GDI RQAHCNVS RTEWNETLQKVAKQLR 
KYFNNKTIIFTNSSGGDLEITTHSFNCGGEFFYCNTSGLFNSTWN^^ 
tNMWQRVGOAMYAP PICKSVrt RC ESNITGLU-TRD^^ 

LGVAPTKAKRR WE REKRAVG1 GAVFLGFLGAAGSTMGAAS ITVTVQARQLLSG I VQQQS NLLRA1 EAQQ 
H LLKLTVWGIKOLQAR VLAVER YLKDQ Q LLGIWGCSG KUCTTN VP WN S SWSN KSQ SEI WDN MTWLQ WD K 
EISNYTDI^YNLIEESQNQQEKNEQDLLALDKWANLVV^AVFDISNVVLVWIWFIM1V 

VANRVRQGYSPI^FQTHTPNPGGLORPGRIEEEGGEQGRORSIRLVSGFLALAWDDLRSLCLFSYHRLRD 

FlUAARTVELLGHSSLKGU^LGWEGLKYLWNLLLYWGRELKISAINLLDTlAlAVAGVSrrDRVIEIGQRI 

CRAILNIPRRIRQGLERAU. 



A con. on v (subtype A consensus en v. Identical amino acid sequence to that in the public domain) 
GCCGCCGCCATGCGCGTGATGGGCATCCAGCGCAACTGCCAGCACCTGTG 
GC6CTGGGGCACCATGATCCTGGGCATGATCATCATCTGCTCCGCCGCCG 
AGAACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGACGCC 
GAGACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGACACCGAGGT 
GCAC^ACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 
AGGAGATCAACCTGGAGAACGTGACCGAGGAGTTCAACATGTGGAAGAAC 
AACATGGTGGAGCAGATGCACACCGACATCATCTCCCTGTGGGACCAGTC 
C CTGAAGC CCTGCGTGAAGCTGACCCCCCTGTGC GTGACCCTGAACTGCT 
CCAACGTGAACGTGACCACCAACATCACCAACATCACCGACAACATGAAG 
GGCGAGATCAAGAACrGCTCCTTCAACATGACCACCGAGCTGCGCGACAA 
GAAGCAGAAGGTGTACTCCCTGTTCTACAAGCTGGACGTGGTGCAGATCA 
ACAAGTCCAACTCCTCCTCCCAGTACCGCCTGATCAACTGCAACACCTCC 
GC C ATCACCCAGGCCTGC CCCAAGGTGTC CTTCGAGCCCATGCC CATCCA 
CTACTGCGCC CCCGCCG GCTTCGCC ATCCTGAAGTGCAAGGACAAGGAGT 
TCAACGGCACCGGCCCCTGCAAGAACGTGTCCACCGTGCAGTGCACCCAC 
GGC ATC AAGCCC GTGGTGTCCAC GCAGCTGCTG CTGAACGGCTCCCTG GC 
CGAGGAGGAGGTGATGATCCGCTCCGAGAACATCACCAACAACGCCAAGA 
ACATCATCGTGCAGCTGACCAAGCCCGTGAAGATCAACTGCACCCGCCCC 
AACAACAACACCCGCAAGTCCATCCGCATCGGCCCCGGCCAGGCCTTCTA 
CG C CAC CG GC GACATGATCGGCGACATCC GC CAG GCCCAC TG CAACGTGT 
CCCGCACCGAGTGGAACGAGACCCTGCAGAAGGTGGCCAAGCAGCTGCGC . 
AAGTACTTC AACAAC AAGAC CATCATC TTCAC CAACTCCTC CG GCG GC GA 
CCTGGAGATC ACC AC CCAC TCC TTCAACTGC GGC G GC GAGTTCTTCTACT 
G C AACAC CTC C G GC CTG TTCAACTCC ACC TGGAACGG CAACGG CACC AAG 
AAGAAGAACTCCACCGAGTCCAACGACACCATCACCCTGCCCTGCCGCAT 
CAAGCAGATCATCAACATGTGGCAGCGCGTGGGCCAGGCCATGTACGCCC 
CCCCCATCCAGGGCGTGATCCGCTGCGAGTCCAACATCACCGGCCTGCTG 
CTGACC C GC GAC GG C GG CGACAAC AACTCC AAGAAC GAG ACC TTC C GC C C 
CGGCGGCGGCGACATGCGCGACAACTGGCGCTCCGAGCTGTACAAGTACA 
AGGTG GTGAAGATC GAG CC CCTG GG C GTG GCCC CCAC C AAGGC CAAGC GC 
CGCGTGGTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCGTGTTCCT 
GGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCACCC 
TGACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGTCC 
AACCTGCTGCGCGCCATCGAGGCCCAGCAGCACCTGCTGAAGCTGACCGT 
GTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTGGCCGTGGAGCGCTACC 
TGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGATC 
TGCACCACCAACGTGCCCTGGAACTCCTCCTGGTCCAACAAGTCCCAGTC 
CGAGATCTGGGACAACATGACCTGGCTGCAGTGGGACAAGGAGATCTCCA 
ACTAC AC C GAC ATCATC TACAAC C TGATCGAGGAG TCC CAGAAC CAGCAG 
GAGAAGAACGAGCAGGACCTGCTGGCCCTGGACAAGTGGGCCAACCTGTG 
GAACTG GTTC GACATC TCCAACTGG CTG TG GTACATC AAGATCTTCATCA 
TGATCGTGGGCGGCCTGATCGGCCTGCGCATCGTGTTCGCCGTGCTGTCC 
GTGATCAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACCCA 
CACCCCCAACCCCGGCGGCCTGGACCGCCCCGGCCGCATCGAGGAGGAGG 
GCGGCGAGCAGGGCCGCGACCGCTCCATCCGCCTGGTGTCCGGCTTCCTG 
. GCCCTGGCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCG 
CCTGCGCGACTTCATCCTGATCGCCGCCCGCACCGTGGAGCTGCTGGGCC 
ACTCCTCCCTGAAGGGCCTGCGCCTGGGCTGGGAGGGCCTGAAGTACCTG 
TGGAAC CTGC TGCTGTACTGGGGCCGCGAGCTGAAGATCTCC GCCATCAA 
CCTGCTGGACACCATCGCCATCGCCGTGGCCGGCTGGACCGACCGCGTGA 
TCGAGATCGGCCAGCGCATCTGCCGCGCCATCCTGAACATCCCCCGCCGC 
ATCCGCCAGGGCCTGGAGCGCGCCCTGCTGTAA 




//,/ 



Celllysate Supernatant 

Hussion o< A.con en,geo« la <*» 



M con. gag (group M consensus gag. Identical amino acid sequence to that In the public domain) 

GCCGCCGCCATGGGCGCCCGCGCCTCCGTGCTGTCCGGCGGCAAGCTGGA . 

CGCCTGGGAGAAGATCCGCCTGCGCCCCGGCGGCAAGAAGAAGTACCGCC 

TGAAGCACCTGGTGTGGGCCTCCCGCGAGCTGGAGCGCTTCGCCCTGAAC 

CCCGGCCTGCTGGAGACCTCCGAGGGCTGCAAGCAGATCATCGGCCAGCT 

GCAGCCCGCCCTGCAGACCGGCTCCGAGGAGCTGCGCTCCCTGTACAACA 

CCGTGGCCACCCTGTACTGCGTGCACCAGCGCATCGAGGTGAAGGACACC 

AAGGAGGCCCTGGAGAAGATCGAGGAGGAGCAGAACAAGTCCCAGCAGAA 

GACCCAGCAGGCCGCCGCCGACAAGGGCAACTCCTCCAAGGTGTCCCAGA 

ACTACCCCATCGTGCAGAACCTGCAGGGCCAGATGGTGCACCAGGCCATC 

TCCCCCCGCACCCTGAACGCCTGGGTGAAGGTGATCGAGGAGAAGGCCTT 

CTCCCCCGAGGTGATCCCCATGTTCTCCGCCCTGTCCGAGGGCGCCACCC 

CCCAGGACCTGAACACCATGCTGAACACCGTGGGCGGCCACCAGGCCGCC 

ATGCAGATGCTGAAGGACACCATCAACGAGGAGGCCGCCGAGTGGGACCG 

CCTGCACCCCGTGCACGCCGGCCCCATCCCCCCCGGCCAGATGCGCGAGC 

CCCGCGGCTCCGACATCGCCGGCACCACCTCCACCCTGCAGGAGCAGATC 

GCCTGGATGACCTCCAACCCCCCCATCCCCGTGGGCGAGATCTACAAGCG 

CTGGATCATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCCCCCGTGT 

CCATCCTGGACATCCGCGAGGGCCCCAAGGAGCCCTTCCGCGACTACGTG 

GACCGCTTCTTCAAGACCCTGCGCGCCGAGCAGGCCACCCAGGACGTGAA 

GAACTGGATGACCGACACCCTGCTGGTGCAGAACGCCAACCCCGACTGCA 

AGACCATCCTGAAGGCCCTGGGCCCCGGCGCCACCCTGGAGGAGATGATG 

ACCG CCTGCCAGGGC GTGGGCGGCCC CGGCC ACAAGGC CCGCGTGCTGGC 

CGAGGCCATGTCCCAGGTGACCAACGCCGCCATCATGATGCAGCGCGGCA 

ACTTCAAG GGCCAGCGCC GCATCATCAAGTGCTTCAACTGCGGCAAGGAG 

GGCCACATCGCCCGCAACTGCCGCGCCCCCCGCAAGAAGGGCTGCTGGAA 

GTGCGGCAAGGAGGGCCACCAGATGAAGGACTGCACCGAGCGCCAGGCCA 

ACTTCCTGGGCAAGATCTGGCCCTCCAACAAGGGCCGCCCCGGCAACTTC 

CTGCAGTCCCGCCCCGAGCCCACCGCCCCCCCCGCCGAGTCCTTCGGCTT 

CGGCGAGGAGATCACCCCCTCCCCCAAGCAGGAGCCCAAGGACAAGGAGC 

CCCCCCTGACCTCCCTGAAGTCCCTGTTCGGCAACGACCCCCTGTCCCAG 

TAA 



M.con.pol.nuc 

GCCGCCGCCatgccccagatcaccctgtggcagcgccccctggtgaccat 

caagatcggcggccagctgaaggaggccctgctggccaccggcgccgacg 

acaccgtgctggaggagatcaacctgcccggcaagtggaagcccaagatg 

atcggcggcatcggcggcttcatcaaggtgcgccagtacgaccagatcct 

gatcgagatctgcggcaagaaggccatcggcaccgtgctggtgggcccca 

cccccgtgaacatcatcggccgcaacatgctgacccagatcggctgcacc 

ctgaacttccccatctcccccatcgagaccgtgcccgtgaagctgaagcc 

cggcatggacggccccaaggtgaagcagtggcccctgaccgaggagaaga 

tcaaggccctgaccgagatctgcaccgagatggagaaggagggcaagatc 

tccaagatcggccccgagaacccctacaacacccccatcttcgccatcaa 

gaagaaggactccaccaagtggcgcaagctggtggacttccgcgagctga 

acaagcgcacccaggacttctgggaggtgcagctgggcatcccccacccc 

gccggcctgaagaagaagaagtccgtgaccgtgctggacgtgggcgacgc 

ctacttctccgtgcccctggacgaggacttccgcaagtacaccgccttca 

ccatcccctccatcaacaacgagacccccggcatccgctaccagtacaac 

gtgctgccccagggctggaagggctcccccgccatcttccagtcctccat 

gaccaagatcctggagcccttccgcacccagaacoccgagatcgtgatct 

accagtacatggacgacctgtacgtgggctccgacctggagatcggccag 

caccgcgccaagatcgaggagctgcgcgagcacctgctgcgctggggctt 

caccacccccgacaagaagcaccagaaggagccccccttcctgtggatgg 

gctacgagctgcaccccgacaagtggaccgtgcagcccatccagctgccc 

gagaaggactcctggaccgtgaacgacatccagaagctggtgggcaagct 

gaactgggcctcccagatctaccccggcatcaaggtgaagcagctgtgca 

agctgctgcgcggcgccaaggccctgaccgacatcgtgcccctgaccgag 

gaggccgagctggagctggccgagaaccgcgagatcctgaaggagcccgt 

gcacggcgtgtactacgacccctccaaggacctgatcgccgagatccaga 

agcagggccaggaccagtggacctaccagatctaccaggagcccttcaag 

aacdcaagaccggcaagtacgccaagatgcgctccgcccacaccaacga 

cgtgaagcagctgaccgaggccgtgcagaagatcgccaccgagtccatcg 

tgatctggggcaagacccccaagttccgcctgcccatccagaaggagacc 




tgggagacctggtggaccgagtactggcaggccacctggattcccgagtg 

ggagttcgtgaacaccccccccctggtgaagctgtggtaccagctggaga 

aggagcccatcgccggcgccgagaccttctacgtggacggcgccgccaac 

cgcgagaccaagctgggcaaggccggctacgtgaccgaccgcggccgcca 

gaaggtggtgtccctgaccgagaccaccaaccagaaaaccgagctgcagg 

ccatccacctggccctgcaggactccggctccgaggtgaacatcgtgacc 

gactcccagtacgccctgggcatcatccaggcccagcccgacaagtccga 

gtccgagctggtgaaccagatcatcgagcagctgatcaagaaggagaagg 

tgtacctgtcctgggtgcccgcccacaagggcatcggcggcaacgagcag 

gtggacaagctggtgtccaccggcatccgcaaggtgctgttcctggacgg 

catcgacaaggcccaggaggagcacgagaagtaccactccaactggcgcg 

ccatggcctccgacttcaacctgccccccatcgtggccaaggagatcgtg 

gcctcctgcgacaagtgccagctgaagggcgaggccatgcacggccaggt 

ggactgctcccccggcatctggcagctggactgcacccacctggagggca 

agatcatcctggtggccgtgcacgtggcctccggctacatcgaggccgag 

gtgatccccgccgagaccggccaggagaccgcctacttcatcctgaagct 

ggccggccgctggcccgtgaaggtgatccacaccgacaacggctccaact 

tcacctccgccgccgtgaaggccgcctgctggtgggccggcatccagcag 

gagttcggcatcccctacaacccccagtcccagggcgtggtggagtccat 

gaacaaggagctgaagaagatcatcggccaggtgcgcgaccaggccgagc 

acctcaagaccgccgtgcagatggccgtgttcatccacaacttcaagcgc 

aagggcggcatcggcggctactccgccggcgagcgcatcatcgacatcat 

cgccaccgacatccagaccaaggagctgcagaagcagatcaccaagatcc 

agaacttccgcgtgtactaccgcgactcccgcgaccccatctggaagggc 

cccgccaagctgctgtggaagggcgagggcgccgtggtgatccaggacaa 

ctccgacatcaaggtggtgccccgccgcaaggccaagatcatccgcgact 

acggcaagcagatggccggcgacgactgcgtggccggccgccaggacgag 

gacTAA 



M.con.nef (group M consensus nef. Identical amino acid sequence to that in the public domain) 

gccgccgccatgggcggcaagtggtccaagtcctccatcgtgggctggcc 

cgccgtgcgcgagcgcatccgccgcacccaccccgccgccgagggcgtgg 

gcgccgtgtcccaggacctggacaagcacggcgccatcacctcctccaac 

accgccgccaacaaccccgactgcgcctggctggaggcccaggaggagga 

ggaggaggtgggcttccccgtgcgcccccaggtgcccctgcgccccatga 

cctacaaggccgccctggacctgtcccacttcctgaaggagaagggcggc 

CTGGAGGGC CTGATCTACTCCAAGAAGCGCC AGGAGATCC TGGACCTGTG . 

GGTGTACCACACCCAGGGCTACTTCCCCGACTGGCAGAACTACACCCCCG 

GCCC CGGC ATCCGCTACCCC CTGAC CTTCG GCTGGTGCTTC AAGCTGGTG 

CCCGTGGACCCCGAGGAGGTGGAGGAGGCCAACGAGGGCGAGAACAACTC 

CCTGCTGCACCCCATGTGCCAGCACGGCATGGAGGACGAGGAGCGCGAGG 

TGCTGATGTGGAAGTTCGACTCCCGCCTGGCCCTGCGCCACATCGCCCGC 

G AGCTGC AC C CCGAGTACTAC AAGGACTGCTAA 



C.con.pol.nuc 

GCCGCCGCCatgccccagatcaccctgtggcagcgccccctggtgtccat 

caaggtgggcggccagatcaaggaggccctgctggccacoggcgccgacg 

acaccgtgctggaggagatcaacctgcccggcaagtggaagcccaagatg 

atcggcggcatcggcggcttcatcaaggtgcgccagtacgaccagatcct 

gatcgagatctgcggcaagaaggccatcggcaccgtgctggtgggcccca 

cccccgtgaacatcatcggccgcaacatgctgacccagctgggctgcacc 

ctgaacttcc^^trtcccccatcgagaccgtgcccgtgaagctgaagcc 

cggcatggacggccccaaggtgaagcagtggcccctgaccgaggagaaga 

tcaaggccctgaccgccatctgcgaggagatggagaaggagggcaagatc 

accaagatcggccccgagaacccctacaacacccccgtgttcgccatcaa 

gaagaaggactccaccaagtggcgcaagctggtggacttccgcgagctga 

acaagcgcacccaggacttctgggaggtgcagctgggcatcccccacccc 

gccggcctgaagaagaagaagtccgtgaccgtgctggacgtgggcgacgc 

rtacttctccgtgcccctggacgagggcttccgcaagtacaccgccttca 

ccatcccctccatcaacaacgagacccccggcatccgctaccagtacaac 

gtgctgccccagggctggaagggctcccccgccatcttccagtcctccat 

gaccaagatcctggagcccttccgcgcccagaaccccgagatcgtgatct 

accagtacatggacgacctgtacgtgggctccgacctggagatcggccag 

caccgcgccaagatcgaggagctgcgcgagcacctgctgaagtggggctt 

caccacccccgacaagaagcaccagaaggagccccccttcctgtggatgg 

gctacgagctgcaccccgacaagtggaccgtgcagcccatccagctgccc 

gagaaggactcctggaccgtgaacgacatccagaagctggtgggcaagct 

gaactgggcctcccagatctaccccggcatcaaggtgcgccagctgtgca 

agctgctgcgcggcgccaaggccctgaccgacatcgtgcccctgaccgag 

gaggccgagctggagctggccgagaaccgcgagatcctgaaggagcccgt 

gcacggcgtgtactacgacccctccaaggacctgatcgccgagatccaga 

agcagggccacgaccagtggacctaccagatctaccaggagcccttcaag 

aacctcaagaccggcaagtacgccaagatgcgcaccgcccacaccaacga 

cgtgaagcagctgaccgaggccgtgcagaagatcgccatggagtccatcg 

tgatctggggcaagacccccaagttccgcctgcccatccagaaggagacc 

tgggagacctggtggaccgactactggcaggccacctggattcccgagtg 

ggagttcgtgaacaccccccccctggtgaagctgtggtaccagctggaga 

aggagcccatcgccggcgccgagaccttclacgtggacggcgccgccaac 

cgcgagaccaagatcggcaaggccggctacgtgaccgaccgcggccgcx;a 

gaagatcgtgtccctgaccgagaccaccaaccagaaaaccgagctgcagg 

ccatccagctggccctgcaggactccggctccgaggtgaacatcgtgacc 

gactcccagtacgccctgggcatcatccaggcccagcccgacaagtccga 

gtccgagctggtgaaccagatcatcgagcagctgatcaagaaggagcgcg 

tgtacctgtcctgggtgcccgcccacaagggcatcggcggcaacgagcag 

gtggacaagctggtgtcctccggcatccgcaaggtgctgttcctggacgg 

catcgacaaggcccaggaggagcacgagaagtaccactccaactggcgcg 

ccatggcctccgagttcaacctgccccccatcgtggccaaggagatcgtg 

gcctcctgcgacaagtgccagctgaagggcgaggccatgcacggccaggt 

ggactgctcccccggcatctggcagctggactgcacccacctggagggca 

agatcatcctggtggccgtgcacgtggcctccggctacatcgaggccgag 



gtgatccccgccgagaccggccaggagaccgcctacttcatcctgaagct 

ggccggccgctggcccgtgaaggtgatccacaccgacaacggctccaact 

tcacctccgccgccgtgaaggccgcctgctggtgggccggcatccagcag 

gagttcggcatcccctacaacccccagtcccagggcgtggtggagtccat 

gaacaaggagctgaagaagatcatcggccaggtgcgcgaccaggccgagc 

acctcaagaccgccgtgcagatggccgtgttcatccacaacttcaagcgc 

aagggcggcatcggcggctactccgccggcgagcgcatcatcgacatcat 

cgccaccgacatccagaccaaggagctgcagaagcagatcatcaagatcc 

agaacttccgcgtgtactaccgcgactcccgcgaccccatctggaagggc 

cccgccaagctgctgtggaagggcgagggcgccgtggtgatccaggacaa 

ctccgacatcaaggtggtgccccgccgcaaggccaagatcatcaaggact 

acggcaagcagatggccggcgccgactgcgtggccggccgccaggacgag 

gacTAA 




M. con. gag (group M consensus gag) 

MGARASVLSGGKLDAWEK1RLRPGGKKKYRLKHUVWASRELERFALNPGLLETSEGCKQUGQLQPA 

LQTGSEELRSLYNTVATLYCVHQRlEVKOTKEALEKIEEEQNKSQQKTQQAAADKGNSSKVSQNYPIVQr4 

LQGQMVHQAISPRTLNAWVKVIEEKAFSPEVIPMFSALSEGATPQDLNTMLNTV^ 

EAAEV^RLHP\^GPIPPGQMREPRGSDlAGTTSTLQEQlA\An^SNPP!PVGEtYKRWllLGLNKIVRMY 

SPVSILD1RQGPKEPFRDY\^RFFKTIJWEC^TQDVKNWMTDTLLVQNANPDCKTILKALGPGATLEEMM 

TACQGVGGPGHKARVUKEAMSQVTNAAIMMQRGNFKGQRRIIKCFNCGKEGHIARNCRAPRKKGCVWCG^ 

EGHQMKOCTER^ 
SLFGNDPLSQ 



M.con.pot (group M consensus pot) 

MPQlTLWQRPLVTIWGGQLKEALLaTGADDTVLEEINIPGKWKP 

KAIGTVLVGPTPVNIIGRNMLTQIGCTLNFPISPIETVPVKLI^GMDG^ 

MEKEGKlSKtGPENPY^ITPIFAJKKKDSTKWRKLVDFRELNKRTQDF^^ 

VGDAYFSVPLOEOFRKYTAniPSINNETPGlRYQYNVLPQGWKGSPAIFQSSMTKILEPFRTQNPEIVl 
YQYMDDLWGSDLEIGQHRAKIEEU?EHLIJ^WGFTrPDKKHQKEPPFLVVMGYEmPDKWTVQPlQLPEKD 
SWTVNDI QKLVG KLNWASQI YPG I KVKQLCKLLRGAKALTOIV^LTE EAE L ELAE N R ElUCEPVHG VYYD 
PSKOUAEIQKQGQOQWTYQIYQEPFKNlXTGKYAK^^SAHTND^^QLTEAVQKIATESlV^WGKTFKFR 
LP I Q KETWETV\Mn*EYWQATWIPEWEFVffTPP LVKLWYQLEKE P IAGAETFYVDGAANRETKLGKAGYVTD 
RGRQKWSLTETTNQiaELQAIHLALQDSGSEVNIVTDS^^^ 

VYLSWVPAHKGfGGNEQVDKLVSTGIRKVLFLOGIDKAQEEHEKYHSNVVRAMASDFNLPPIVAKEIVASC 
DKCQLKGEAM HGQVDC SP Gl WQLDCTHLEG K11 LVAVHVASGYI EAEVIPAETGQ ETAYF I LKLAGR WPV 
KVJHTDNGSNFTSAAVKAACWWAGIQQEFGJPYNPQSQGWESMNKELKKJIGQVRDQAEHLXTAVQMAV 
FIHNFKRKGGIGGYSAGERHOIIATOIQTKELQKQITKIQNFRVYYRDSRDPIVWGPAKLLWKGEGAW 
IQONSDIKWPRRKAKIIRDYGKQMAGOOCVAGRQDED 




WRI^VPLRPMTYKAALDLSHFLKEKGGLEGUYSWWQEIIiLVWVYKt^ 

GWCFKLVPWPEEVEEANEGENNSLLHPMCQHGMEDEEREVLMWKFOSRUVLRHIARELHPEYYKDC 



C. 0^t_ VS. t\ 



C.con.poi (subtype C consensus pot) 

MPQITLWQRPLVSlKVGGQIKEALLaTGADDTVLEElNLPGKWKPKMIGGIGGFIKVRQYDQILIEICGK 

KA1GTVLVGPTPVN 1 1 GR N MLTQLGCTLN FPI S PI ETVPVKLKPG MDGPKVKQ WPLTEEKI KALTA1 CEE 

MEKEGKITKIGPENPYNTPVFAIKKKOSTKWRKLVOFRELNKRTQDFWEVQUGIPHPAGLKKKKSVTVLD 

VGDAYFSVPLDEGFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRAQNPEIVI 

YQYMDDLWGSDLEIGQHRAKlEELREHUKWGFTTPDKKHQKEPPFUWMGYELHPDKVSrrVQPJQLPEKD 

SVVTTVNDIQKLVGKLNWASQIYPGIKVRQLCKLUIGAKALTDJVPLTEEAELEIAENREILKEPVHGVYY^ 

PSKDLIAEIQKQGHDQV\nYQIYQEPFKNLKTGKYAKMRTAHTNDVKQLTE^ 

LPIQKETWETTWWTDYWQATWIPEWEFVNTPPLVK^ 

RGRQKIVSLTETTNQKTELQAIQLALQDSGSEVNIVTOSQYAl.GHQAQPDKSESELVNQnEQUKKER 
VYLSWVPAHKGIGGNEQVOKLVSSGIRKVIJLOGIOKAQEEHEKYHSNWRAMASEFNLPPIVAKEIVASC 
O KCQ LKG EAM HG Q VDC S PG I WQLDCTHLEG WllVAVHVASG YIEAEVt PAETGQ ETA YFfLKLAGR WP V 

KV) HTDN GSN FTSAAVKAACWVVAGIQQ EFGI P YNPQSQGWE ^ M ^^^J^^^Y|JP^^y y^^f?*' 1 ^ 
FIHNFKRKGGIGGYSAGERI»DIIATDIQTKELQKQ11K1QNFRVYYR0SRDPJWKGPAKLLWKGEGAW 

I Q DNSD1 KWP RRKAKI J KD YG KQMAGADCVAGR Q DED 



■Mcon.gaQ (subtype B consensus gag. The amino acid sequence is different from Los Alamos Database August 2002) 

!gccgccgccatgggcgcccgcgcctccgtgctgtccggcggcgagctgga 

ccgctgggagaagatccgcctgcgccccggcggcaagaagaagtacaagc 

tgaagcacatcgtgtgggcctcccgcgagctggagcgcttcgccgtgaac 

cccggcctgctggagacctccgagggctgccgccagatcctgggccagct 

gcagccctccctgcagaccggctccgaggagctgcgctccctgtacaaca 

ccgtggccaccctgtactgcgtgcaccagcgcatcgaggtgaaggacacc 

aaggaggccctggagaagatcgaggaggagcagaacaagtccaagaagaa 

GGCCCAGCAGGCCGCCGCCGACACCGGCAACTCCTCCCAGGTGTCCCAGA 

actaccccatcgtgcagaaoctgcagggccagatggtgcaccaggccatc 
tccccccgcaccctgaacgcctgggtgaaggtggtggaggagaaggcctt 
ctcccccgaggtgatccccatgttctccgccctgtccgagggcgccaccc 
cccaggacctgaacaccatgctgaacaccgtgggcggccaccaggccgcc 

ATGCAGATGCTGAAGGAGACCATCAACGAGGAGGCCGCCGAGTGGGACCG 

CCTGCACCCCGTGCACGCCGGCCCCATCGCCCCCGGCCAGATGCGCGAGC 

CCCGCGGCTCCGACATCGCCGGCACCACCTCCACCCTGCAGGAGCAGATC 

GGCTGGATGACCAACAACCCCCCCATCCCCGTGGGCGAGATCTACAAGCG 

CTG GATCATCCTGGGCCTGAACAAGATCGTGCG CATGTACTCCCCCACCT . 

CCATCCTGGACATCCGCCAGGGCCCCAAGGAGCCCTTCCGCGACTACGTG 

GACCGCTTCTACAAGACCCTGCGCGCCGAGCAGGCCTCCCAGGAGGTGAA 

GAACTGGATGACCGAGACCCTGCTG GTGCAGAACGC C AACCC CGACTGCA 

AGACCATCCTGAAGGCCCTGGGCCCCGCCGCCACCCTGGAGGAGATGATG 

ACCGCCTGCCAGGGCGTGGGCGGCCCCGGCCACAAGGCCCGCGTGCTGGC 

CGAGGCCATGTCCCAGGTGACCAACTCCGCCACCATCATGATGCAGCGCG 

G CAACTTC CG CAAC C AGC GCAAGAC CGTGAAGTGCTTCAACTGC G G C AAG 

GAGG G C C AC ATCGC CAAGAACTGCCGCGCCC CCCGC AAGAAGG GCTGCTG 

GAAGTGCGG CAAGGAGGGCCACCAGATGAAGGACTGCACCGAGC GC CAGG 

CCAACTTCCTGGGCAAGATCTGGCCCTCCCACAAGGGCCGCCCCGGCAAC 

TTCCTGCAGTCCCGCCCCGAGCCCACCGCCCCCCCCGAGGAGTCCTTCCG 

CTTCGGCGAGGAGACCACCACCCCCTCCCAGAAGCAGGAGCCCATCGACA 

AGGAGCTGTACCCCCTGGCCTCCCTGCGCTCCCTGTTCGGCAACGACCCC 

TCCTCCCAGTAA 



B con env (subtype B consensus env. The amino acid sequence is different from Los Alamos Database August 2002) 

GCCGCCGCCATGCGCGTGAAGGGCATCCGCAAGAACTACCAGCACCTGTG 

GCGCTGGG GC ACCATGCTGCTGGGCATGCTGATGATCTGCTCCGC CGCCG 

AGAAGCTGTGGGTGACCG7GTACTAC GGC GTGCCC GTGTGGAAGGAGGCC 

ACCACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGACACCGAGGT 

GCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 

AGGAGGTGGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAAC 

AACATGGTGGAGCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTC 

CCTGAAGCCCTGC GTGAAG CTGACCCCC CTGTGCGTGACCCTGAACTGCA 

CCGACCTGAAGAACAACCTGCTGAACACCAACTCCTCCTCCGGCGAGAAG 

ATGGAGAAGGGCGAGATCAAGAACTGCTCCTTCAACATCACCACCTCCAT 

CCGCGACAAGGTGCAGAAGGAGTACGCCCTGT7CTACAAGCTGGACGTGG 

TGCCCATCGACAACAACAACAACACCTCCTACCGCCTGATCTCCTGCAAC 

ACCTCCGTGATCACCCAGGCCTGCCCCAAGGTGTCCTTCGAGCCGATCCC 

CATCCACTACTGCGCCCCCGCCGGCTTCGCCATCCTGAAGTGCAACGACA 

AGAAGTTCAACGGCACCGGCCCCTGCACCAACGTGTCCACCGTGCAGTGC 

ACCCACGGCATCCGCCCCGTGGTGTCCACCCAGCTGCTGCTGAACGGCTC 

CCTGGCCGAGGAGGAGGTGGTGATCCGCTCCGAGAACTTCACCGACAACG 

CCAAGACCATCATCGTGCAGCTGAACGAGTCCGTGGAGATCAACTGCACC 

CGCCCCAACAAGAACACCCGCAAGTCCATCCACATCGGCCCCGGCCGCGC 

CTTCTACACC ACCGG CGAGATGATCGGCGACATCCGC CAGGCCCACTGCA 

ACATCTC CCGCGCCAAGTG GAACAACAC CCTGAAGGAGATCGTGAAGAAG 

CTGCGCGAG CAGTTCGGCAACAAGACCATCGTGTTCAAC CAGTCCTCCGG 

CGGCGACC C CGAGATCGTGATGCACTCCTTCAACTGC GGCG GCGAGTTCT 

TCTACTGCAACACCACCCAGCTGTTCAACTCCACCTGGAACGACAACGGC 

ACCTGGAACAACACCAAGGACAAGAACACCATCACCCTGCCCTGCCGCAT 

CAAGCAGATCATCAACATGTGGCAGGAGGTGGGCAAGGCCATGTACGCCC 

CCCCCATCCGCGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTGCTG 

CTGACCCGCGACGGCGGCAACAACAACAACGACACCGAGATCTTCCGCCC 

CG GCGG CGGCGACATGCGCGACAACTGG CGCTCCGAG CTGT ACAAGTAC A 
AGGTGGTGAAGATCGAGCCCCTGGGCGTGGCCCCCACCAAGGCCAAGCGC 

CGCGTGGTGCAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCATGTTCCT 

GGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATGACCC 

TGACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGAAC 

AACCTGCTGCGCGCCATCGAGGCCCAGCAGCACCTGCTGCAGCTGACCGT 

GTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTGGCCGTGGAGCGCTACC 

TGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGATC 

TGCACCACCACCGTGCCCTGGAACGCCTCCTGGTCGAACAAGTCCC7GGA 

CGAGATCTG GGACAACATGAC CTGGATG GAGTGGGAGCGC GAGATCGAC A 

ACTACACCTCCCTGATCTACACCCTGATCGAGGAGTCCCAGAACCAGCAG 

GAGAAGAACGAGCAGGAGCTGCTGGAGCTGGACAAGTGGGCCTCCCTGTG 

GAACTGGTTCGACATCACCAACTGGCTGTGGTACATCAAGATCTTCATCA 

TGATCGTGGGCGGCCTGATCGGCCTGCGCATCGTGTTCGCCGTGCTGTCC 

ATCGTGAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACCCG 

CCTGCCCGCCCCCCGCGGCCCCGACCGCCCCGAGGGCATCGAGGAGGAGG 

GCGGCGAGCGCGACCGCGACCGCTCCGGCCGCCTGGTGGACGGCTTCCTG 

GCCCTGATCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCG 

CCTGCGCGACCTGCTGCTGATCGTGACCCGCATCGTGGAGCTGCTGGGCC 

GCCGCGGCTGGGAGGTGCTGAAGTACTGGTGGAACCTGCTGCAGTACTGG 

TCCCAGGAGCTGAAGAACTCCGCCGTGTCCCTGCTGAACGCCACCGCCAT 

CGCCGTGGCCGAGGGCACCGACCGCGTGATCGAGGTGGTGCAGCGCGCCT 

GCCGCGCCATCCTGCACATCCCCCGCCGCATCCGCCAGGGCCTGGAGCGC 

GCCCTGCTGTAA 



MGARASVLSGGELDR WE K) RLR P GGKKKYKL KHI VWASRE LE RFAVN P GLLETSEGC R Q1LGQLQP SLOT 
GSEELRSLYNTVATLYCVHQRIEVKDTKEAUEKIEEEQNKSKKKAQQAAADTGNSSQVSQNYPIVQNLQG 
QMVKCWISPRTLI^WV/KWEEKAFSPEVIPMFS^ 

EWDRLHPWAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTMNPPrPVGEtYKRVVIlLGLNKIVRMYSPT 
SILDlRQGPKEPFRDYVDRFYKTLPJkEC^SQEVKNWMTET^^ 

QGVGGPGHKARVI^EAMSQVTNSATINAMQRGNFRNQRKTVKCFNCG KEGHIA KNCRAPRKKGCWKCGKEG 

HQMKDCTERQANFLGKIVN^SHKGRPGNFLQSRPEPTAPPEESFRFGEETTTPSQKQEPIDKELYPLASLR 

SLFGNDPSSO 



D.mn.onv tsuuiTuo o Minociiaws 

M R VKGIR KNYQ H LWR WGTM LLGM I Ml CSAAEKL VW7VYYG VPVWKEATT7l.FCASDAK^ 
HACVPTOPNPQEVN^ENVTENFNMWKNNMVEQMHEOIISLWDQSL^ 

NSSSGEKMEKGEIKNCSFNITTSIRDKVQKEYALFYKLDNA^IONNN^SYRUSCWSVITQACPKVSF 

EP1PIHYCAPAGFAILKCNDKKFNGTGPCTNVSWQCTHGIRPWSTQLLLNGSLAEEEWIRSENFTDN 

AKTllVQLNESVElNCTRPNNfrTRKSiHIGPGRAFYTTGEllGDIRQAHCNlSRAKWNNTLKQIVKKLRE 

QFGNKTr\/FNQSSGGDPEIVMHSFNCGGEFFYCNTTQLFNSTVS/NDNGTWNNTKDKNTITLPCRIKQIINM 

WQEVGKAMYAPPIRGQIRCSSNITGLLLTRDGGNNNNDTBFRPGGGDMRDMWRSELYKYKWKIEPLGV 

APTKAKRRWQREKRAVGIGAMFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLL 

QLTNWGIKaiC^RVLAVERYLKDQQLLGIWGCSGKUCTTTW^ 

NYTSU YTLl E ESQ NQG EKN EQE LLELDKWAS LWNWFOITN WLWYI KIF IM 1 VGGLIG LR I VFAVLS I VN 
RVRQGYSPLSFQTRLPAPRGPDRPEGIEEEGGERORDRSGRLVDGFLALIWDOLRSLCLFSYHRLRDLLL 
iV/TRI VELUGRRG WEVLKYWWN LLQ YWS Q ELKN SAVS LLN ATA1 AVAEGTDRVIEWO RAC RAI L H I P RR 
IRQGLERALL 
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